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The invention relates to a method for producing 
fatty acid esters which contain unsaturated fatty 
acids with at least three double-bonds, and free 
unsaturated fatty acids having a content of at least 1 
wt.- %, in relation to the total amount of fatty acids 
contained in plants, by expression of at least one 
nucleic acid sequence which codes for a polypeptide 
having D-6-desaturase activity, and at least one 
nucleic acid sequence which codes for a polypeptide 
having D-6-elongase activity. Advantageously, said 
nucleic acid sequences can be expressed, optionally, 
together with a third nucleic acid sequence in the 
transgenic plant, which codes for a polypeptide having 
D-5-desaturase activity. The invention also relates to 
the use of defined nucleic acid sequences which code 
for polypeptides having a D-6-desaturase activity, D-6- 
elongase activity or D-5-desaturase activity, selected 
from a group of nucleic acid sequences, or to the use 
of nucleic acid structures containing the above- 
mentioned nucleic acid sequences. 



(21) 2 485 060 

(22) 25.04.2003 



BioayaUwtla chain 



Li na laic acid 



,cocx\ 



Liooleaic acid 



f-i4aolenic acid 



A6-Daaaturaa« 



fttnorirkMiin acid 



Uiboao^T-iinoltt&lc acid 



A6-Blongose 



slcoaatotraanoic acid 



COOH 



Aiocbidonic acid 



AS-Deaaturaee 



,O0QN 



Eieo»apant«*nolc acid 



,cocw 



1+1 



Office de la Propriete 

Intellectuelte 

du Canada 

Un organisme 
d'lndustrie Canada 



Canadian 

Intellectual Property 
Office 

An agency of 
Industry Canada 



CA 2485060 A1 2003/11/13 

(2D 2 485 060 

d2)DEMANDE DE BREVET CANADIEN 
CANADIAN PATENT APPLICATION 

(13) A1 



(86) Date de de>6t PCT/PCT Filing Date: 2003/04/25 

(87) Date publication PCT/PCT Publication Date: 2003/11/13 

(85) Entree phase nationale/National Entry: 2004/10/28 

(86) N° demande PCT/PCT Application No.: EP 2003/004297 

(87) N° publication PCT/PCT Publication No.: 2003/093482 
(30) Priorite/Priority: 2002/04/29 (102 19 203.0) DE 



(51) Cl.lnt. 7 /lntCI. 7 C12N 15/82, C12P 7/64 

(71) Demandeur/Applicant: 

BASF PU\NT SCIENCE GMBH, DE 

(72) Inventeurs/lnventors: 
CIRPUS, PETRA, DE; 
RENZ, ANDREAS, DE; 
LERCHL, JENS, SE; 
KUIJPERS, ANNE-MARIE, DE 

(74) Agent: ROBIC 



(54)Titre : PROCEDE POUR PRODUIRE DES ACIDES GRAS POLYINSATURES DANS DES PLANTES 
(54) Title: METHOD FOR PRODUCING MULTIPLE UNSATURATED FATTY ACIDS IN PLANTS 



Biosynthesis chain 



Linoleic acid 



Linolexiic acid 



.COOH 



A6-Dosaturase 



y-Linolenic acid 



COOH 



A6-Elongase 



Dihomo-Y-linolenic acid 



Arachidonic acid 



"COOH 



A5-Desaturase 



.COOH 



Stearidonic acid 



Eicosatetraenoic acid 



,COOH 



"COOH 



XXXDH 



Eicosapentaenoic acid 



.COOH 



(57) Abrege/Abstract: 

The invention relates to a method for producing fatty acid esters which contain unsaturated fatty acids with at least three double- 
bonds, and free unsaturated fatty acids having a content of at least 1 wt- %, in relation to the total amount of fatty acids contained 
in plants, by expression of at least one nucleic acid sequence which codes for a polypeptide having D-6-desaturase activity, and at 
least one nucleic acid sequence which codes for a polypeptide having D-6-elongase activity. Advantageously, said nucleic acid 
sequences can be expressed, optionally, together with a third nucleic acid sequence in the transgenic plant, which codes for a 
polypeptide having D-5-desaturase activity. The invention also relates to the use of defined nucleic acid sequences which code for 
polypeptides having a D-6-desaturase activity, D-6-elongase activity or D-5-desaturase activity, selected from a group of nucleic 
acid sequences, or to the use of nucleic acid structures containing the above-mentioned nucleic acid sequences. 
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(57) Abstract: The invention relates to a method for producing fatty acid esters which contain unsaturated fatty acids with at least 
three double-bonds, and free unsaturated fatty acids having a content of at least 1 wt.- %, in relation to the total amount of fatty 
acids contained in plants, by expression of at least one nucleic acid sequence which codes for a polypeptide having D-6-dcsalurasc 
activity, and at least one nucleic acid sequence which codes for a polypeptide having D-6-clongasc activity. Advantageously, said 
nucleic acid sequences can be expressed, optionally, together with a third nucleic acid sequence in the transgenic plant, which codes 
for a polypeptide having D-5-desaturase activity. The invention also relates to the use of defined nucleic acid sequences which code 
for polypeptides having a D-6-desaturase activity, D-6-elongase activity or D-5-desaturase activity, selected from a group of nucleic 
acid sequences, or to the use of nucleic acid structures containing the above-mentioned nucleic acid sequences. 
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einer Nukleinsauresequenz, die fiir ein Polypeptid rait D-6-ElongaseaktivitjU codiert. Vorteilhaft konnen diese Nukleinsauresequen- 
zen gegebenentalls zusammen mit einer dritten Nukleinsauresequenz, die fUr ein Polypeptid mit D-5-Desaruraseaktivitat codiert, 
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METHOD FOR PRODUCING MULTIPLE UNSATURATED FATTY ACIDS IN 

PLANTS 

Description 

The present invention relates to a method for the production of 
fatty acid esters which comprise unsaturated fatty acids with at 
Least three double bonds, and to free unsaturated fatty acids 
with a content of at least 1% by weight based on the total fatty 
acids present in the plants, by expressing at least one nucleic 
acid sequence which encodes a polypeptide with A6-desaturase 
activity and at least one nucleic acid sequence which encodes a 
polypeptide with A6-elongase activity. Advantageously, these 
nucleic acid sequences can, if appropriate, be expressed in the 
transgenic plant together with a third nucleic acid sequence 
which encodes a polypeptide with A5-desaturase activity. 

The invention furthermore relates to the use of defined nucleic 
acid sequences which encode polypeptides with a A6-desaturase 
activity, A6-elongase activity or A5-desaturase activity selected 
from a group of nucleic acid sequences, and/or to the use of 
nucleic acid constructs comprising, the abovementioned nucleic 
acid sequences. 

Certain products and by-products of naturally occurring metabolic 
processes in microbial cells or in the cells of animals and, 
advantageously plants, have utility for a wide range of 
industries, including the feed, food, cosmetics and 
pharmaceutical industries. These molecules, which are 
collectively termed "fine chemicals", also include, for example, 
lipids and fatty acids, one representative class of which are the 
polyunsaturated fatty acids. Polyunsaturated fatty acids (PUFAs) 
are added for example to infant formula for increasing the 
nutritional value of these foods. PUFAs have, for example, a 
positive effect on the cholesterol level in the blood of humans 
and are therefore useful for protection against heart disease. 
Fine chemicals such as polyunsaturated fatty acids (PUFAs) can be 
isolated from animal sources such as, for example, fish, or 
produced by microorganisms by culturing microorganisms which have 
been developed such that they produce and accumulate or secrete 
large amounts of one or more desired molecules. 

Fatty acids and triglycerides have a multiplicity of uses in the 
food industry, in animal nutrition, in cosmetics and in the 
pharmacological sector. Depending on whether they take the form 
of free saturated or unsaturated fatty acids or triglycerides 



0093/00064 CA 02485060 2004-10-29 

2 

with an increased content of saturated or unsaturated fatty 
acids, they are suitable for a variety of uses* Polyunsaturated Q 
3-fatty acids and £2 6-f atty acids constitute an important part of 
animal and human nutrition. Owing to the present-day composition 
5 of human nutrition, an addition of polyunsaturated Q 3-fatty 

acids, which are predominantly found in fish oils, to the food is 
of particular importance. Thus, for example, polyunsaturated 
fatty acids such as docosahexaenoic acid (=DHA, 
C22:6 A4 ' 7 ' 10 ' 13 ' 16 ' 19 ) or eisosapentaenoic acid (= EPA, 
10 c20:5 A5,8 ' 1U4 ' 17 ) is added to baby formula for increasing the 

nutritional value. DHA is said to have a positive effect on brain 
development. 

The various acids and triglycerides are obtained mainly from 
15 microorganisms such as Mortierella or from oil— producing plants 
such as soybeans, oilseed rape, sunflower, algae such as 
Cryptocodinium or Phaeodactylum and others, the products being 
obtained, as a rule, in the form of their triacylglycerides (* 
triglycerides - triglycerols ) . However, they can also be obtained 
20 from animals such as, for example/ fish. The free fatty acids are 
advantageously prepared by hydrolysis. Higher polyunsaturated 
fatty acids such as DHA, EPA, arachidonic acid (= ARA, 
C20:4 A5 ' 8 ' n ' 14 ) , dihomo-y-linolenic acid (C20:3 A 8'i^i 4 ) or 
docosapentaenoic acid (DPA, C22:5 A7 * 10 ' 13 ' 16 ' 19 ) cannot be isolated 
25 from oil crops such as oilseed rape, soybeans, sunflower, 

saf flower or others. Conventional natural sources of these fatty 
acids are fish such as herring, salmon, sardine, redfish, eel, 
carp, trout, halibut, mackerel, zander or tuna, or algae. 

30 Depending on the intended purpose, oils with saturated or with 
unsaturated fatty acids are preferred; thus, for example, lipids 
with unsaturated fatty acids, specifically polyunsaturated fatty 
acids, are preferred in human nutrition- The polyunsaturated Q 
3-fatty acids are said to have a positive effect on the 

35 cholesterol level in the blood and thus on the possibility of 
preventing heart disease. The risk of heart disease, stroke or 
hypertension can be reduced markedly by adding these u)3-fatty 
acids to the food. Also, Q 3-fatty acids can have a positive 
effect on inflammatory processes, specifically chronically 

40 inflammatory processes in connection with immunological diseases 
such as rheumatoid arthritis. These fatty acids are therefore 
added to foodstuffs, specifically dietetic foodstuffs, or are 
used in medicaments. 

45 In connection with these rheumatic diseases due to the usual 

composition of our foods, Q 6- fatty acids such as arachidonic acid 
tend to have a negative effect on these diseases. 
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2 3- and Q 6 -fatty acids are precursors of tissue hormones, what 
are known as eicosanoids such as the postaglandins, which are 
derived from dihomo-y-linolenic acid, arachidonic acid and 
eicosapentaenoic acid, the thromoxanes and the leukotrienes , 
5 which are derived from arachidonic acid and eicosapentaenoic 

acid. Eicosanoids (known as the PG2 series), which are formed from 
Q 6-f atty acids, promote, as a rule, inflammatory reactions, while 
eicosanoids (known as the PG 3 series) from Q 3-f atty acids have a 
minor, or no, proinflammatory action. 

10 

Owing to the positive properties, there has been no lack of 
attempts in the past to make available genes which are involved 
in the synthesis of fatty acids or triglycerides, for the 
production, in various organisms, of oils with a modified content 

15 of unsaturated fatty acids. Thus, WO 91/13972 and its US 

equivalent describe a A9— desaturase . A A15— desaturase is claimed 
in W0 93/11245 and a A12-desaturase is claimed in- wo 94/11516. 
Further desaturases are described, for example, in 
EF-A-0 550 162, WO 94/18337, WO 97/30582, WO 97/21340, 

20 WO 95/18222, EP-A-0 794 250, Stukey et al., J. Biol- Chem. , 265, 
1990: 20144-20149, Wada et al., Nature 347, 1990: 200-203 
or Huang et al., Lipids 34, 1999: 649-659. However, the 
biochemical characterization of the various desaturases is 
incomplete as yet since the enzymes, being membrane -bound 

25 proteins, can only be isolated and characterized with great 
difficulty (McKeon et al., Methods in Enzymol. 71, 1981: 
12141-12147, Wang et al., Plant Physiol. Biochem., 26, 1988: 
777—792). As a rule, membrane -bound desaturases are characterized 
by introduction into a suitable organism which is subsequently 

30 analyzed for enzyme activity by means of analyses of the starting 
material and the product. A 6— Desaturases are described in 
WO 93/06712, US 5,614,393, US5614393, WO 96/21022, WO00/21557 and 
WO 99/27111, and their application for the production in 
transgenic organisms has also been described, such as in 

35 W098/46763 W098/46764, W09846765. In this context, the expression 
of various desaturases is also described and claimed, as is the 
case in W099/64616 or W098/46776, as is the formation of 
polyunsaturated fatty acids. As regards the efficacy of the 
expression of desaturases and their effect on the formation of 

40 polyunsaturated fatty acids, it must be noted that only minor 
contents of A6 -unsaturated fatty acids/lipids, such as, for 
example, gamma-linolenic acid and stearidonic acid, have been 
obtained by expression of a single desaturase, as described to 
date. Moreover, a mixture of co 3- and u> 6-f atty acids has been 

45 obtained as a rule, since all of the A6-desaturases described to 
date converted for example not only linoleic acid (ca 6 -fatty 
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acid), but also a-linolenic acid (<a3-fatty acid). 

Particularly suitable microorganisms for the production of PUFAs 
are microorganisms such as Thraustochytrium species or 
5 Schizochytrium species, algae such as Phaeodactylum tricornutum 
or Crypthecodiniura species, ciliates such as Stylonychia or 
Colpidium, fungi such as Mortierella, Entomophthora or Mucor. 
Strain selection has made possible the development of mutant 
strains of the microorganisms in question which produce a series 

10 of desirable compounds, including PUFAs. The mutation 

and selection of strains with an improved production of a 
particular molecule, such as the polyunsaturated fatty acids, is, 
however, a time-consuming and difficult procedure. This is why 
recombinant methods are preferred whenever possible , as described 

15 above. However , only limited amounts of the desired 

polyunsaturated fatty acids such as DPA, EPA or ARA can be 
produced with the aid of the abovementioned microorganisms, these 
unsaturated fatty acids being obtained, as a rule, as fatty acid 
mixtures of, for example, EPA, DPA and DHA, depending on the 

20 microorganism used. 

As an alternative , the production of fine chemicals can suitably 
be carried out on a. large scale vi.a the production in plants 
which have been developed such that they produce the 

25 abovementioned PUFAs. Plants which are particularly suited to 

this purpose are oil crops, which comprise large amounts of lipid 
compounds, such as oilseed rape, canola, linseed, soyabeans, 
sunflowers, borage and evening primrose. However, other crop 
plants which comprise oils or lipids and fatty acids are also 

30 well suited, as mentioned in the extensive description of the 
present invention. Conventional breeding has given rise to a 
series of mutant plants which produce a spectrum of desirable 
lipids and fatty acids, cof actors and enzymes. However, the 
selection of new plant varieties with improved production of a 

35 particular molecule is a time-consuming and difficult procedure 
or is indeed impossible if the compound does not occur naturally 
in the plant in question, as in the case of polyunsaturated Cjg-, 
C2o-fatty acids and C22-fatty acids and those with longer carbon 
chains . 

40 

Owing to the positive properties of unsaturated fatty acids, 
there has been no lack of attempts in the past to make available 
these genes which are involved in the synthesis of fatty acids or 
triglycerides for the production, in various plants, of oils with 
45 a modified content of polyunsaturated fatty acids. However, it 
has been impossible as yet to produce longer-chain 
polyunsaturated C20- and/or C22 -fatty acids such as EPA or ARA in 
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plants . 

It was therefore an object to develop a method for the production 
of polyunsaturated fatty acid esters and/or free polyunsaturated 
5 fatty acids with at least three double bonds in the fatty acid 
molecule. This object was achieved by the method according to the 
invention for the production of compounds of the general formula 
I: 



10 



R 1 




(I) 



15 

in transgenic plants with a content of at least 1% by weight 
based on the total fatty acids, which process comprises the 
following steps: 

20. aj introducing, into a plant, at least one nucleic acid sequence 
which encodes a polypeptide with a A6-desaturase activity? 
and 

b) introducing at least one second nucleic acid sequence which 
25 encodes a polypeptide with a A6-elongase activity; and, 

c) if appropriate, introducing a third nucleic acid sequence 
which encodes a polypeptide with a A5-desaturase activity; 

d) followed by growing and harvesting the plants; and 

30 

where the variables and substituents in the formula I have the 
following meanings: 



Ri = -OH, coenzyme A (thioester), phosphatidylcholine, 
35 phosphatidylethanolamine , phoshatidylglycerol, 

diphosphatidylglycerol, phosphatidylserine , 
phosphatidylinositol, sphingolipid, glycoshingo lipid or a 
radical of the following general formula II 



40 



45 



H 2 C-0-R 2 



B 
H 



Lo-B> < IX > 
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r2 = h, phosphatidylcholine- , phosphatidylethanolamine-, 
phosphatidylglycerol-, diphosphatidylglycerol-, 
phosphatidylserine-, phosphatidylinositol-, shingolipid-, 
glycoshingolipid-, glycoshingolipid- or saturated or 
5 unsaturated C2-C24~alkylcarbonyl-, 

R 3 = H, saturated or unsaturated C2— C 2 4— alkylcarbonyl-, or 

r 2 and R 3 independently of one another represent a radical of the 
10 general formula la 



15 



CH=CH' 



•CH. 



■CHf 



•CH- 



(la) 



n 8 3, 4 or 6 , m « 3 , 4 or 5 and p = 0 or 3, preferably n =* 3 , m 
M or 5 and p = 0 or 3, 



20 



Ri in the compounds of the formula I denotes -OH ( hydroxy 1-), 
acetyl-coenzyme phosphatidylcholine-, 
phosphatidylethanolamine-, phoshatidylglycerol-, 
diphosphatidylglycerol- , phosphatidylserine- , 
25 phosphatidylinositol-, sphingolipid-, glycoshingolipid- or a 
radical of the following general formula II 



H 2 C-0-R 2 



30 



B 
H 



A— 0— L 



(ID 



The abovementioned radicals for R 1 are in each case bound to the 
35 compounds of the formula I in the form of esters or thioesters. 

R 2 in the compounds of the formula II denotes hydrogen, 
phosphatidylcholine-, phosphatidylethanolamine-, 
phoshatidylglycerol- , diphosphatidylglycerol-, 
40 phosphatidylserine-, phosphatidylinositol-, shingolipid-, 

glycoshingolipid-, glycoshingolipid- or saturated or unsaturated 
C2— c 24~ alkylcarbonyl- . 



Unsaturated or saturated C2-C22-alkylcarbonyl which may be 
45 mentioned are radicals such as ethylcarbonyl, n-propylcarbonyl, 
n-butylcarbonyl, n-pentylcarbonyl, n-hexylcarbonyl , 
n-hepty lcarbonyl , n-octy lcarbony 1 , n-nonylcarbony 1 , 
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n-decy lcarbonyl , n-undecy lcarbony 1 , n— dodecylcarbonyl , 
n-tridecy lcarbonyl, n—tetradecy lcarbonyl, n-pentadecy lcarbonyl, 
n-hexadecy lcarbonyl, n— heptadecy lcarbonyl, n-octadecy lcarbonyl, 
n-nonadecy lcarbonyl, n—eicosy lcarbonyl, n-docosanylcarbonyl or 
5 n-tetracosanylcarbonyl, all of which may comprise one or more 
double bonds . Preferred are saturated or unsaturated 
Cio-C22- a tlfcyl car bonyl radicals such as n-decy lcarbonyl, 
n-undecy lcarbony 1 , n-dodecy lcarbony 1 , n—tridecy lcarbonyl , 
n— tetr adecylcarbonyl , n— pentadecylcarbony 1 , n— hexadecylcarbonyl , 

10 n-hept adecy 1 c ar bony 1 , n-octadecy lcarbony L, n—nonadecy lcarbony 1 , 
n— eicosylcarbonyl, n-docosanylcarbonyl or n-tetracosanylcarbonyl, 
all of which comprise one or more double bonds. Especially 
preferred are saturated or unsaturated C^o— C22— alky lcarbonyl 
radicals such as C10— alkylcarbonyl, Cu— alkylcarbonyl, 

15 C12— alky lcarbonyl, C 13 — alkylcarbonyl, C 14— alkylcarbonyl, 
C 16 ~alky lcarbonyl , Cis— alkylcarbonyl , C2<r-aikylcarbony 1 , 
C22— alkylcarbonyl or C 2 4— alkylcarbonyl radicals, all of which 
comprise one or more double bonds. Very especially preferred are 
saturated or unsaturated Cig— C 2 2~alky lcarbonyl radicals such as 

20 C 16 -alky lcarbonyl, Cis— alkylcarbonyl, C20— alkylcarbonyl or 
c 2 2— alkylcarbonyl radicals, all of which comprise one or more 
double bonds. Preferably, the abovementioned radicals comprise 
two, three, four or five double bonds. Especially preferably, the 
radicals comprise three, four or five double bonds. Very 

25 especially preferred are Cxs— alkylcarbonyl radicals which comprise 
one, two, three or four double bonds and C2o-alky lcarbonyl 
radicals which comprise three, four or five double bonds. All of 
the abovementioned radicals are derived from the corresponding 
fatty acids. 

30 

r3 denotes hydrogen or saturated or unsaturated 
C2-C 2 4-alky lcarbonyl . 



35 

R 2 and R 3 in the compounds of the formula II independently of one 
another furthermore denote a radical of the general formula la 




45 where n = 3, 4 or 6, m = 3, 4 or 5 and p = 0 or 3, preferably n = 
3, m = 4 or 5 and p * 0 or 3. 



0093/00064 



CA 02485060 2004-10-28 



8 

The abovementioned radicals R 1 , R 2 and R 3 may* also have attached 
to them substituents such as hydroxyl or epoxy groups or else 
comprise triple bonds. 

5 The nucleic acid sequences used in the method according to the 
invention are isolated nucleic acid sequences which encode 
polypeptides with AS— , A6— desaturase or A6— elongase activity. 

The compounds of the formula I which are produced in this method 
10 advantageously comprise a mixture of differing radicals R 1 , R 2 or 
R 3 which can be derived from differing glycerides . Moreover, the 
abovementioned radicals can be derived from different fatty acids 
such as short-chain fatty acids having 4 to 6 carbon atoms r 
medium-chain fatty acids having 8 to 12 carbon atoms or 
15 long-chain fatty acids having 14 to 24 carbon atoms; the 
long-chain fatty acids are preferred. 

The method according to the invention advantageously gives fatty 
acid esters (= compounds of the formula I) with polyunsaturated 

20 C 1 b— , C20- and/or C22-fatty acid molecules with at least two double 
bonds in the fatty acid ester. Preferably, these fatty acid 
molecules comprise three, four or five double bonds and 
advantageously lead to the synthesis of y-linolenic acid GLA r 
CIB^* 6 ' 9 ' 1 *) , stearidonic acid (» SDA, C18 : 4* 6 ^ 12, 15 ) , 

25 dihomo-Y-linolenic acid (* DGLA, 20:3 Ae ' 11 ' L4 > , eicosatetraenoic 
acid (= ETA, C20:4 A5 ' 8 ' 1L '14) , arachidonic acid (ARA) , 
eicosapentaenoic acid (EPA) or their mixtures, preferably EPA 
and/ or ARA. 

30 The fatty acid esters with polyunsaturated Cis-, C20- and/or 

C22-fatty acid molecules can be isolated from the organisms which 
have been used for the production of the fatty acid esters in the 
form of an oil or lipid , for example in the form of compounds 
such as sphingolipids , phosphoglycerides , lipids, glycolipids 

35 such as glycoshingolipid, phospholipids such as 

phosphatidylethanol amine, phosphatidylcholine, phoshatidylserine/ 
phosphatidylglycerol, phosphatidylinositol or 
diphosphatidylglycerol, monoacylglycerides, diacylglycerides, 
triacylglycerides or other fatty acid esters such as the 

40 acetyl-coenzyme A esters which comprise the polyunsaturated fatty 
acids having at least two, preferably three, double bonds. In 
addition to these esters, the polyunsaturated fatty acids axe 
also present in the plants as free fatty acids or bound in other 
compounds. As a rule, the different abovementioned compounds 

45 (fatty acid esters and free fatty acids) are present in the plant 
in an approximate distribution of 80 to 90% by weight of 
triglycerides, 2 to 5% by weight of diglycerides, 5 to 10% by 
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weight of monoglycerides , 1 to 5% by weight of free fatty acids, 
2 to 8% by weight of phospholipids, the total of the different 
compounds making 100% by weight. 

5 When the compounds of the general formula I are produced in the 
method according to the invention, they are produced in a content 
of at least 1% by weight, advantageously at least 2% by weight, 
preferably at least 3% by weight, especially preferably at least 
5% by weight, very especially preferably at least 10% by weight 

10- based on the total of the fatty acids in the transgenic plant. 
Since, in the method according to the invention, the starting 
compounds linoleic acid (C18:2) and/or linolenic acid (C18:3) 
undergo several reaction steps, the end products of the method, 
such as, for example, arachidonic acid (ARA) or eicosapentaenoic 

15 acid (EPA) are not obtained as pure products, but there are 

always minor amounts of the precursors still present in the end 
product. If both linoleic acid and linolenic acid- are present in 
the original plant, the end products such as ARA and EPA are 
present as mixtures. The precursors should advantageously not 

20 amount to more than 20% by weight, preferably not more than 15% 
by weight, especially preferably not more than 10% by weight, 
very especially preferably not more than 5% by weight, based on 
the amount of the end product in question. Advantageously, the 
end products which are produced in the method according to the 

25 invention in a transgenic plant are only ARA or only EPA, either 
bound or as free acids (see compounds of the general formula I). 
If both compounds (ARA + EPA) are produced simultaneously, they 
are advantageously prduced in a ratio of at least 1:2 (EPA:ARA), 
advantageously at least 1:3, preferably 1:4, especially 

30 preferably 1:5. 

Suitable organisms for the production in the method according to 
the invention are, in principle, all plants such as mosses, 
algae, dicots or monocots. It is advantageous to use, in the 

35 method according to the invention, organisms which belong to the 
oil-producing organisms, i.e. which are used for the production 
of oils, such as algae like Crypthecodinium, Phaeodactylum or 
plants, in particular plants, preferably oil crops, which 
comprise large amounts of lipid compounds, such as peanut, 

40 oilseed rape, canola, sunflower, saf flower, poppy, mustard, hemp, 
castor-oil plant, olive, sesame, Calendula, Punica, evening 
primrose, verbascum, thistle, wild roses, ha2elnut r almond, 
macadamia, avocado, bay, pumpkin/ squash, linseed, soybean, 
pistachios, borage, trees (oil palm, coconut or walnut) or field 

45 crops such as maize, wheat, rye, oats, triticale, rice, barley, 
cotton, cassava, pepper, Tagetes, Solanaceae plants such as 
potato, tobacco, eggplant and tomato, Vicia species, pea, alfalfa 
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or bush plants (coffee, cacao, tea), Salix species and perennial 
grasses and fodder crops . Preferred plants according to the 
invention are oil crops such as peanut, oilseed rape, canola, 
sunflower, saf flower, pea, mustard, hemp, castor-oil plants, 
5 olive, Calendula, Punica, evening primrose, pumpkin /squash, 
linseed, soybean, borage, trees (oil palm, coconut). Especially 
preferred are plants which are high in C18:2- and/or C18:3-fatty 
acid, such as sunflower, safflower, tobacco, verbascum, sesame, 
cotton, pumpkin/squash, poppy, evening primrose, walnut, linseed, 
10 hemp, thistle or safflower. Very especially preferred are plants 
such as safflower, sunflower, poppy, evening primrose, walnut, 
linseed or hemp. 

Owing to the enzymatic activity of the nucleic acids used in the 

15 method according to the invention, which encode polypeptides with 
A5— , A6-desaturase~or A6-elongase activity, different compounds 
of the formula I can be produced. Depending on the choice of the 
plant used for the method according to the invention, mixtures of 
the different compounds of the general formula I or individual 

20. compounds, such as EPA or ARA, can be produced in free or bound 
form. Depending on the fatty acid composition which prevails in 
the original plant (C18:2- or C18:3-fatty acids), this gives 
compounds of the general formula I. which are derived from 
C18:2-fatty acids, such as GLA-, DGLA- or ARA-comprising 

25 compounds of the formula I, or compounds which are derived from 
C18:3-fatty acids, such as SDA-, ETA- or EPA-comprising compounds 
of the formula I. If linoleic acid (= LA, C18:2 A9 ' 12 ) is the only 
unsaturated fatty acid present in the plant used for the method, 
only GLA, DGLA and ARA can be formed as products of the method, 

30 all of which can be present as free fatty acids or in bound form. 
If a-linolenic acid ALA, C18:3 A9 '* 2 '^) is the only unsaturated 
fatty acid present in the plant used in the method, for example 
such as in linseed, only SDA, ETA and EPA can be formed as 
products of the -method, all of which can be present as free fatty 

35 acids or in bound form, as described above. By modifying the 
activity of the enzymes implicated in the synthesis (A5-, 
A6-desaturase and A6-elongase) , or by introducing only the first 
two genes ( A6-desaturase and A6-elongase) of the synthetic 
cascade, it is possible to produce in a targeted manner only 

40 individual products in the abovementioned plants (see Figure I). 
Due to the activity of the enzymes A6-desaturase and A6-elongase, 
GLA and DGLA, or SDA and ETA, respectively, form, depending on 
the original plant and the unsaturated fatty acid. DGLA or ETA, 
respectively, or mixtures of these are formed preferentially. If 

45 the enzyme AS-desaturase is additionally introduced into the 
plant, ARA or EPA are additionally formed. It is advantageous 
only to synthesize ARA or EPA or their mixtures, depending on the 
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fatty acid which is present in the plant and which acts as 
starting material for the synthesis. Since biosynthetic cascades 
are involved, the end products in question are not present in 
pure form in the plants . There are always minor amounts of the 
5 precursor compounds present in the end product. These minor 
amounts amount to less than 20% by weight, advantageously less 
than 15% by weight, especially advantageously less than 10% by 
weight, very especially advantageously less than 5, 4, 3, 2 or 1% 
by weight, based on the end product DGLA, ETA or their mixtures, 
10 or ARA, EPA or their mixtures, respectively. 

For the purposes of the method according to the invention, 
transgenic plants are also understood as meaning plant cells, 
plant organs or intact plants which are grown for the production 
15 of compounds of the general formula I. Growing is understood as 
meaning for example cultuxing of the transgenic plant cells, 
plant tissue or plant organs on a nutrient medium- or the intact 
plant on or in a substrate, for example in hydroponic culture or 
on an arable soil. 

20 

Nucleic acids which can be used in the method according to the 
invention are, in principle, all those which encode polypeptides 
with A5— , A6-desaturase- or A6— elongase activity. These nucleic 
acids are advantageously derived from plants such as algae, such 

25 as Isochrysis or Crypthecodinium, diatoms such as Phaeodactylum, • 
mosses such as Physcomitrella, Ceratodon or higher plants such as 
the primulaceae, such as Aleuritia, Calendula stellata, 
Osteospermum spinescens or Osteospermum hyoseroides, 
microorganisms such as fungi, such as Aspergillus, 

30 Thraustochytrium, Phytophtora, Entoraophthora, Mucor or 
Mortierella, yeasts or animals such as nematodes, such as 
Caenorhabditis , insects or humans. The A5— , A6— desaturase or 
A6~elongase genes are advantageously derived from fungi or from 
plants such as algae or mosses, preferably from plants. 

35 

It is advantageous to in the method according to the invention, a 
nucleic acid sequence selected from the group of the in SEQ ID 
NO; 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 
or their derivative or horaologs which encode polypeptides which 

40 retain the enzymatic activity. These sequences, individually or 
in combination, are cloned into expression constructs? these 
expression constructs are represented in the sequences SEQ ID NO: 
33-37. These expression constructs make possible an optimal 
synthesis of the compounds of the general formula I produced in 

45 the method according to the invention. 
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in a preferred embodiment, the method furthermore comprises the 
step of obtaining a cell which comprises the nucleic acid 
sequences which are used in the method and which encode a A5— or 
A6-desaturase and a A6— elongase, where a cell is transformed with 
5 the nucleic acid sequence, a gene construct or a vector which 
bring about the expression of the A5— , A6— desaturase or 
A6-elongase nucleic acid, alone or in combination. In a further 
preferred embodiment, the method furthermore comprises the step 
of obtaining the fine chemical from the culture. The cell 
10 generated thus is advantageously a cell of an oil crop such as, 
for example, peanut, oilseed rape, canola, linseed, soybean, 
saf flower, hemp, sunflowers or borage. 

A transgenic plant is understood as meaning, for the purposes of 
15 the invention, that the nucleic acids used in the method are not 
at their natural locus in the genome of an organism; in this 
context, the nucleic acids can be expressed homologously or 
heterologously. However, transgenic also means that, while the 
nucleic acids according to the invention are at their natural 
2Q locus in the genome of ah organism, the sequence has been 
modified in comparison with the natural sequence and/or the 
regulatory sequences of the natural sequences have been modified. 
Preferably, transgenic is understood as meaning that the nucleic 
acids according to the invention axe not expressed at their 
25 natural locus in the genome, that is to say that homologous or 
preferably heterologous expression of the nucleic acids takes 
place. Preferred transgenic plants are the oil crops. 

Transgenic plants which comprise the compounds of the formula I 

30 which have been synthesized in the method according to the 
invention can be marketed directly without isolation of the 
compounds which have been synthesized. Plants are understood as 
meaning, in the method according to the invention, all plant 
parts, plant organs such as leaf, stem, root, tuber or seeds, or 

35 all of the plant. In this context, the seed comprises all parts 
of the seed such as the seed coats, epidermis cells and seed 
cells, endosperm or embyro tissue. However, the compounds 
produced in the method according to the invention can also be 
isolated from the plants in the form of their oils, fat, lipids 

40 and/or free fatty acids. Compounds of the formula I which have 
been produced by this method can be harvested by harvesting the 
organisms either from the culture in which they grow or from the 
field. This can be done by pressing or extracting the plant 
parts, preferably the plant seeds, in this context, the oils, 

45 fats, lipids and/or free fatty acids can be obtained by pressing 
by what is known as cold-beating or cold-pressing, without 
supplying heat. The plant parts, specifically the seeds, are 
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beforehand comminuted, steam-treated or toasted in order to 
facilitate their disruption. The seeds pretreated thus can 
subsequently be pressed or else extracted with solvents such as 
warm hexane. The solvent is subsequently removed. In this manner, 
5 more than 96% of the compounds produced in the method can be 
isolated. The resulting products are subsequently processed 
further, i.e. refined. Here, the plant mucilages and turbid 
matter are first. What is known as degumming can be performed 
enzymatically or, for example, chemico-physically by adding acid 

10 such as phosphoric acid. The free fatty acids are subsequently 
removed by treatment with a base, for example sodium hydroxide 
solution. The resulting product is washed thoroughly with water 
to remove the alkali remaining in the product, and dried. To 
remove the coloring matter which still remains in the product, 

15 the products are bleached, for example using bleaching earth or 
active charcoal. At the end, the product is deodorized, for 
example by using steam. 

The PUFAs produced by this method are preferentially C 18 - or 
20 C20-22-* atty acid molecules having "at least two double bonds in 
the fatty acid molecule, preferably three, four, in combination 
with a further elongases and a A4-desaturase five or six double 

■ 

bonds. These C lQ - or C 2 o-22-fatty acid molecules can be isolated 
from the organism in the form of an oil, lipid or a free fatty 
25 acid. Suitable organisms are, for example, those which have been 
mentioned above. Preferred organisms are transgenic plants. 

In a preferred embodiment, oils, lipids or fatty acids or 
fractions of these which have been produced by the 
30 above -described method are especially preferably oil, lipid or a 
fatty acid composition which comprise PUFAs or which originate 
from transgenic plants. 

A further embodiment according to the invention is the use of the 
35 oil, lipid or the fatty acid composition in foods, feeds, 
cosmetics or pharmaceuticals. 

The term "oil" or "fat" is understood as meaning a fatty acid 
mixture which comprises unsaturated, saturated, preferably 

40 esterified fatty acid(s). It is preferred that the oil or fat has 
a high content of unsaturated, unconjugated esterified fatty 
acid(s), in particular linoleic acid, y-linolenic acid, 
dihomo-f-linolenic acid, arachidonic acid, ct-linolenic acid, 
stearidonic acid, eicosatetraenoic acid or eicosapentaenoic acid. 

45 The amount of unsaturated esterified fatty acids is preferably 
approximately 30%, with an amount of 50% being more preferred and 
an amount of 60%, 70%, 80% or more being even more preferred. For 
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identification purposes, it is possible, for example r to 
determine the amount of fatty acid by gas chromatography after 
converting the fatty acids into the methyl esters by means of 
transesterif ication. The oil or fat can comprise various other 
5 saturated or unsaturated fatty acids, for example calendulic 
acid, palmitic acid, stearic acid, oleic acid and the like. The 
amount of the various fatty acids in oil or fat can vary in 
particular as a function of the original plant. 

10 The compounds of the formula I which are produced in the method 
and which comprise polyunsaturated fatty acids having at least 
two double bonds are sphingolipids, phosphoglycerides, lipids, 
glycolipids , phospholipids , monoacy lglycerol , diacylglycerol , 
triacylglycerol or other fatty acid esters. 

15 

The polyunsaturated fatty acids which are present can be 
liberated from the compounds of the general formula I produced 
thus in the method according to the invention for example via 
treatment with alkali, for example aqueous KOH or NaOH , or acid 
20 hydrolysis, advantageously in the "presence of an alcohol such as 
methanol or ethanol, or via enzymatic cleavage and isolated via, 
for example, phase separation and subsequent acidification with, 
for example, H2SO4.. However, the fatty acids can also be liberated 
directly without the above-described processing. 

25 

After they have been introduced into plant cells or plants, the 
nucleic acids used in the method can either be located on a 
separate plasraid or integrated into the genome of the host cell. 
In the case of integration into the genome, the integration can 

30 be random or be effected by recombination in such a way that the 
native gene is replaced by the copy being introduced, whereby the 
production of the desired compound by the cell is modulated, or 
by using a gene in trans, so that the gene is linked operably 
with a functional expression unit which comprises at least one 

35 sequence which ensures the expression of a gene and at least one 
sequence which ensures the polyadenylation of a functionally 
transcribed gene. The nucleic acids are advantageously introduced 
into the plants via mult iexpress ion cassettes or constructs for 
the multiparallel seed-specific expression of genes. 

40 

Mosses and algae are the only known plant systems which produce 
substantial amounts of polyunsaturated fatty acids such as 
arachidonic acid { ARA) and/or eicosapentaenoic acid (EPA) and/or 
docosahexaenoic acid (DHA). Mosses comprise PUFAs in membrane 
45 lipids, while algae, organisms which are related to algae and 
some fungi also accumulate substantial amounts of PUFAs in the 
triacylglycerol fraction. This is why nucleic acid molecules 
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which are isolated from such strains which also accumulate PUFAs 
in the triacylglycerol fraction are especially advantageously 
suitable for the method according to the invention and thus for 
the modification of the lipid and PUFA production system in a 
5 host, in particular plants , such as oil crops, for example 

oilseed rape, canola, linseed, hemp, soybean, sunflowers, borage. 
They can therefore be used advantageously in the method according 
to the invention. 

10 It has been possible to date to demonstrate that a- trienoic acid 
with Cia carbon chain can be produced with the aid of desaturases. 
These methods which are known from the literature claim the 
production of 7-linolenic acid. However, nobody has as yet been 
able to demonstrate the production very long-chain 

15 polyunsaturated fatty acids (with C 20 - and longer carbon chain and 
of trienoic acids and higher unsaturated types) by modified 
plants alone. 

To produce the longer-chain PUFAs according to the invention, the 

20 polyunsaturated Ci 8 -fatty acids must first be desaturated by the 
enzymatic activity of a desaturase and subsequently elongated by 
at least two carbon atoms via an elongase. After one elongation 
cycle, this enzyme .activity gives £ 20 -fatty acids, and after two 
or three elongation cycles C 22 - or C 24 -fatty acids- The activity 

25 of the desaturases and elongases used method according to the 
invention gives by preference C 18 -/ C 20 - and/or C 22 -fatty acids 
having at least two double bonds in the fatty acid molecule, by 
preference three, four or five double bonds, especially 
preferably C 18 - and/or C 2 o-fatty acids with at least two double 

30 bonds in the fatty acid molecule, prefereably with three, four or 
five double bonds in the molecule. After a first desaturation and 
the elongation have taken place, further desaturation steps such 
as, for example, in A5-position, may take place. Especially 
preferred products of the process according to the invention are 

35 arachidonic acid and eicosapentaenoic acid. The Ci 8 -fatty acids 
with at least two double bonds in the fatty acid can be elongated 
by the enzymatic activity according to the invention in the form 
of the free fatty acid or in the form of the esters, such as 
phospholipids , glycolipids , sphingolipids , phosphoglycerides , 

40 monoacylglycerol, diacylglycerol or triacylglycerol. 

Using cloning vectors in plants and in the transformation of 
plants like those which are published and cited in: Plant 
Molecular Biology and Biotechnology (CRC Press, Boca Raton, 
45 Florida), Chapter 6/7, pp. 71-119 (1993); F.F . White, Vectors for 
Gene Transfer in Higher Plants; in: Transgenic Plants, Vol. 1, 
Engineering and Utilization, Eds.: Kung and R. Wu, Academic 
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Press, 1993, 15-38; B. Jenes et al., Techniques for Gene 
Transfer, in: Transgenic Plants, Vol. 1, Engineering and 
Utilization, Eds.: Kung and R. wu, Academic Press (1993), 
128-143; Potrykus, Annu. Rev. Plant Physiol. Plant Molec. Biol. 
5 42 (1991), 205-225)), the nucleic acids can be used for the 
recombinant modification of a broad spectrum of plants so that 
this plant becomes a better or more efficient producer of one or 
more lipid-derived products, such as PUFAs. This improved 
production or production efficiency of a lipid-derived product, 
10 such as PUFAs, can be brought about by a direct action of the 
manipulation or an indirect action of this manipulation. 

A series of mechanisms exist by means of which the modification 
of a desaturase protein according to the invention can have a 

15 direct effect on the yield, production and/or production 
efficiency of a fine chemical from an oil crop plant or a 
microorganism, owing to a modified protein. The number or 
activity of the desaturase protein or desaturase gene and of gene 
combinations of desaturases and elongases can be increased, so 

20 that larger amounts of these compounds are produced de novo since 
the organisms lacked this activity and ability to biosynthesize 
them prior to introduction of the gene in question- This also 
applies analogously to the combination with further desaturases 
or elongases or further enzymes of the lipid metabolism. The use 

25 of various divergent sequences, i.e. sequences which differ at 
the DNA sequence level, may also be advantageous, or else the use 
of promoters for gene expression which makes possible a different 
temporal gene expression, for example as a function of the degree 
of maturity of the seed or oil-storing tissue. 

30 

The introduction of a desaturase and/or elongase gene, or several 
desaturase and elongase genes, into an organism, alone or in 
combination with other genes into a cell can not only increase 
the biosynthesis flux toward the end product, but also increase, 

35 or generate de novo, the corresponding triacylglycerol 

composition. Likewise, the number or activity of other genes 
which participate in the import of nutrients required for the 
biosynthesis of one or more fine chemicals (for example fatty 
acids, polar and neutral lipids) can be increased, so that the 

40 concentration of these precursors, cof actors or intermediates 
within the cells or within the storage compartment is increased, 
thus further increasing the ability of the cells to produce PUFAs 
as described hereinbelow. Fatty acids and lipids themselves are 
desirable as fine chemicals; by optimizing the activity or 

45 increasing the number of one or more desaturases and/or elongases 
which participate in the biosynthesis of these compounds, or by 
destroying the activity of one or more desaturases which 
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participate in the breakdown of these compounds, it can be 
possible to increase the yield, production and/or efficiency of 
the production of fatty acid and lipid molecules from plants, 

5 The isolated nucleic acid molecules used in the process according 
to the invention encode proteins or parts of these, the proteins, 
or the individual protein or parts thereof, comprising an amino 
acid sequence with sufficient homology with an amino acid 
sequence of the sequence SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 

10 18, 20, 22, 24, 26, 28, 30 or 3 2 so that the protein or the part 
thereof retains a desaturase or elongase activity. Preferably, 
the protein or the part thereof which is encoded by the nucleic 
acid molecule has its essential enzymatic activity and the 
capability of being implicated in the metabolism of compounds 

15 which are required for the synthesis of plant cell membranes or 
in the transport of molecules across these membranes. 
Advantageously, the protein encoded by the nucleic acid molecules 
is at least approximately 50%, preferably at least approximately 
60% and more preferably at least approximately 70%, 80% or 90% 

20 and most preferably at least approximately 95%, 96%, 97%, 98%, 
99% or more homologous to an amino acid sequence of the sequence 
SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 
or 32. Preferably, .the protein is a full-length protein which is 
essentially homologous in parts to a total amino acid sequence of 

25 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 
or 32 (which is the result of the open reading frame shown in 
SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 
or 31). For the purposes of the invention, homology and 
homologous are understood as meaning identity or identical. 

30 

The term essential enzymatic activity of the desaturases and the 
elongase used is understood as meaning that, in comparison with 
the proteins /enzymes encoded by the sequences with SEQ ID NO: 1, 
3, 5, 7, 9, 11,-13, 15, 17, 19, 21, 23, 25, 27, 29 or 31, they 

35 retain at least an enzymatic activity of at least 10%, preferably 
20%, especially preferably 30% and very especially 40% and can 
thus be implicated in the metabolism of compounds which are 
required for the synthesis of fatty acids in a plant cell or in 
the transport of molecules across membranes, meaning desaturated 

40 Ci8- or C20-22- carbon chains with double bonds at at least two, 
advantageoulsy three, four or five positions. 

Nucleic acids which can advantageously be used in the process 
originate from fungi or plants such as algae or mosses of the 
45 genera Physcomitrella, Thraustochytrium, Phytophtora, Ceratodon, 
Isochrysis, Aleurita, Muscarioides , Mortierella, Borago, 
Phaeodactylum, Crypthecodinium or from nematodes such as 
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Ceanorhabditis , specifically from the genera and species 
Physcomitrella patens, Phytophtora infestans, Ceratodon 
purpureus, Isochrysis galbana, Aleurita farinosa, Muscarioides 
viallii, Mortierella alpina, Borago officinalis , Phaeodactylum 
5 tricormutum or Ceanorhabditis elegans. 

As an alternative, the isolated nucleotide sequences used can 
encode desaturases or elongases which hybridize, for example 
under stringent conditions, with a nucleotide sequence of the SEQ 
10 ID NO: 1, 3, 5, 7, 9, 11 # 13, 15, 17, 19, 21, 23, 25, 27, 29 or 
31. 

The nucleic acid sequences used in the process are advantageously 
introduced in an expression cassette which makes possible the 
15 expression of the nucleic acids in plants. 

Advantageous expression cassettes are shown in SEQ ID NO: 33 to 
37. Here, the nucleic acid sequences encoding the desaturases 
and/or the elongases are linked operably with one or more 

20 regulatory signals, advantageously for enhancing gene expression. 
These regulatory sequences are intended to make possible the 
specific expression of genes and of protein expression. Depending 
on the host organism, this may mean, for example, that the gene 
is expressed and/ or overexpressed only after induction, or else 

25 that it is immediately expressed and/or overexpressed. For 

example , these regulatory sequences take the form of sequences to 
which inductors or repressors bind and thus regulate expression 
of the nucleic acid. In addition to these novel regulatory 
sequences, or instead of these sequences, the natural regulation 

30 of these sequences before the actual structural genes may still 
be present and, if appropriate, may have been genetically 
modified so that the natural regulation has been switched off and 
the expression of the genes enhanced. However, the expression 
cassette expression construct = gene construct) can also be 

35 simpler in construction, that is to say no additional regulatory 
signals have been inserted before the nucleic acid sequence or 
its derivatives, and the natural promoter together with its 
regulation has not been removed. Instead, the natural regulatory 
sequence has been mutated in such a way that regulation no longer 

40 takes place and/or gene expression is enhanced. These modified 
promoters can also be placed before the natural gene alone in the 
form of part-sequences (= promoter together with parts of the 
nucleic acid sequences according to the invention) to enhance the 
activity. Moreover, the gene construct can advantageously also 

45 comprise one or more enhancer sequences in operable linkage with 
the promoter, which make possible an enhanced expression of the 
nucleic acid sequence. Also, additional advantageous sequences, 
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such as further regulatory elements or terminators , may be 
inserted at the 3 ' terminus of the DNA sequences . 
The A5-desaturase/A6— desaturase and/or A6— elongase genes may be 
present in the expression cassette {- gene construct) in one or 
5 more copies. Advantageous ly, in each case only one copy of the 
genes is present in the expression cassette- This gene construct, 
or the gene constructs , can be expressed together in the host 
organism. In this context, the gene construct (s) can be inserted 
in one or more vectors and be present in the cell in free form or 
10 else be inserted in the genome* It is advantageous for the 

insertion of further genes in the host genome when the genes to 
be expressed are present together in one gene construct. 

In this context, the regulatory sequences or factors can, as 
15 described above, preferably have a positive effect on the gene 
expression of the genes which have been introduced, thus 
enhancing it. Thus, the regulatory elements can advantageously be 
enhanced at transcriptional level by using strong transcription 
signals such as promoters and/or enhancers . In addition, however, 
20 ah enhancement of translation is also possible, for example by 
improving the stability of the mRNA. 

A further embodiment of the invention are one or more gene 
constructs which comprise one or more sequences which are defined 

25 by SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29 or 31 and which encode polypeptides of SEQ ID NO: 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 or 32. The 
abovementioned desaturases introduce a double bond into the A5 or 
A6 position, the substrate having one, two, three or four double 

30 bonds. Elongase ( A6-elongase) has an enzyme activity which 
elongates a fatty acid by at least two carbon atoms. The same 
applies to its homologs, derivatives or analogs which are linked 
operably with one or more regulatory signals, advantageously for 
enhancing gene expression. 

35 

Advantageous regulatory sequences for the novel process are 
present, for example, in promoters such as cos, tac, trp, tet, 
trp-tet, lpp, lac, lpp-lac, lad*' T7, T5, T3, gal, trc, ara, SP6, 
X-P R or X-P L promoter and are advantageously used in Gram-negative 

40 bacteria. Further advantageous regulatory sequences are present, 
for example, in the Gram-positive promoters amy and SP02, in the 
yeast or fungal promoters ADC1, MFa, AC, P-60, CYC1, GAPDH, TEF, 
rp28, ADH or in the plant promoters CaMV/35S [Franck et al., Cell 
21 (1980) 285-294], PRPl [Ward et al., Plant. Mol. Biol. 22 

45 (1993)], SSU, OCS, lib4 , usp r STLS1, B33, nos or in the ubiquitin 
or phaseolin promoter. Also advantageous in this connection are 
inducible promoters such as the promoters described in EP— A— 0 
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388 186 (benzylsulfonamide-inducible) , Plant J. 2, 1992:397-404 
(Gatz et al. p tetracyclin-inducible ) , EP— A— 0 335 528 (abscisic 
acid- inducible) or WO 93/21334 (ethanol- or cyclohexenol- 
inducible) . Further useful plant promoters are the potato 
5 cytosolic FBPase promoter or ST-LSI promoter (Stockhaus et al., 
EMBO J. 8, 1989, 2445), the Glycine max phosphoribosyl- 
pyrophosphate amidotransf erase promoter (Genbank Accession No. 
U87999) or the node-specific promoter described in EP— A— 0 
249 676. Especially advantageous promoters are promoters which 

10 make possible expression in tissues which are implicated in fatty 
acid biosynthesis. Very especially advantageous are seed-specific 
promoters, such as the USP promoter in accordance with the 
specification, but also other promoters such as the LeB4 f DC3, 
phaseolin or napin promoter. Further especially advantageous 

15 promoters are seed-specific promoters which can be used for 
monocots or dicotsand which are described in US 5,608 f 152 
(oilseed rape napin promoter), WO 98/45461 (Arabidopsis oleosin 
promotor), US 5,504,200 (Phaseolus vulgaris phaseolin promoter), 
WO 91/13980 (Brassica Bce4 promoter) described by Baeumlein et 

20 al., Plant J;, 2, 2, 1992:233-239 (LeB4 promoter from a legume), 
said promoters being useful in dicots* The following promoters 
are suitable for example in monocots: barley lpt— 2 or lpt-1 
promoter (WO 95/15389 and WO 95/23230), barley hordein promoter 
and other suitable promoters which are described in WO 99/16890. 

25 

In principle, it is possible to use all natural promoters with 
their regulatory sequences like those mentioned above for the 
novel process, it is likewise possible and advantageous to use 
synthetic promoters, in addition or alone, especially when they 
30 confer seed-specific expression, such as, for example, described 
in WO 99/16890. 

In order to achieve a particularly high PUFA content in 
transgenic plants, the PUFA biosynthetic genes should 

35 advantageously be expressed in oil crops in a seed-specific 
manner. To this end, seed-specific promoters can be used, or 
those promoters which are active in the embryo and/or in the 
endosperm. In principle, seed-specific promoters can be isolated 
from both dicots and monocots . Advantageous preferred promoters 

40 are detailed hereinbelow: USP (= unknown seed protein) and 
vicilin (Vicia faba) [Baumlein et al., Mol. Gen Genet. , 1991, 
225(3)], napin (oilseed rape) [US 5,608,152], Acyl-Carrier 
Protein (oilseed rape) [US 5,315,001 and WO 92/18634], oleosin 
(Arabidopsis thaliana) [WO 98/45461 and WO 93/20216], phaseolin 

45 (Phaseolus vulgaris) [US 5,504,200], Bce4 [wo 91/13980], legume 
B4 (LegB4 promoter) [Baumlein et al., Plant J., 2, 2, 1992], Lpt2 
and lptl (barley) [WO 95/15389 and WO95/23230], seed-specific 
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promoters from rice, maize and wheat [WO 99/16890], Amy32b, Amy 
6-6 and aleurain [US 5,677,474], Bce4 (oilseed rape) 
[US 5,530,149], glycinin (soya) [EP 571 741], phosphoenolpyruvate 
carboxylase (soya) [JP 06/62870], ADR12-2 (soya) [WO 98/08962], 
5 isocitrate lyase (oilseed rape) [US 5,689,040] or 
P-amylase (barley) [EP 781 849]. 

Plant gene expression can also be facilitated via a chemically 
inducible promoter (see a review in Gatz 1997, Annu. Rev, Plant 

10 Physiol. Plant Mol. Biol., 48:89-108). Chemically inducible 

promoters are particularly suitable when it is desired that gene 
expression should take place in a time-specific manner. Examples 
of such promoters are a salicylic-acid-inducible promoter (WO 
95/19443), a tetracyclin-inducible promoter (Gatz et al. (1992) 

15 Plant J. 2, 397-404) and an ethanol-inducible promoter. 

To ensure the stable integration of the biosynthesis genes into 
the transgenic plant over a plurality of generations, each of the 
nucleic acids which encode A6-desaturase, A5-desaturase, or 

20 A6-elongase and which are used in the process should be expressed 
under the control of a separate promoter, preferably a promoter 
which differs from the other promoters, since repeating sequence 
motifs can lead to .instability of the T-DNA, or to recombination 
events. In this context, the expression cassette is 

25 advantageously constructed in such a way that a promoter is 
followed by a suitable cleavage site, advantageously in a 
po lylinker, for insertion of the nucleic acid to be expressed 
and, if appropriate, a terminator sequence is positioned behind 
the polylinker. This sequence is repeated several times, 

30 preferably three, four or five times, so that up to five genes 
can be combined in one construct and introduced into the 
transgenic plant in order to be expressed. Advantageously, the 
sequence is repeated up to three times ( see sequence listing SEQ 
ID NO: 33 to 37). To express the nucleic acid sequences, the 

35 latter are inserted after the promoter via a suitable cleavage 
site, for example in the polylinker. Advantageously, each nucleic 
acid sequence has its own promoter and, if appropriate, its own 
terminator sequence. However, it is also possible to insert a 
plurality of nucleic acid sequences after a promoter and, if 

40 appropriate, before a terminator sequence. Here, the insertion 
site, or the sequence, of the inserted nucleic acids in the 
expression cassette is not of critical importance, that is to say 
a nucleic acid sequence can be inserted at the first or last 
position in the cassette without its expression being 

45 substantially influenced thereby. Advantageously, different 

promoters such as, for example, the USP, LegB4 or DC 3 promoter, 
and different terminator sequences can be used in the expression 
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cassette. However, it is also possible to use only one type of 
promoter in the cassette, which, however, may lead to undesired 
recombination events. 

5 As described above, the transcription of the genes which have 
been introduced should advantageously be terminated by suitable 
terminator sequences at the 3' end of the biosynthesic genes 
which have been introduced (after the stop codon) . An example of 
a sequence which can be used in this context is the 0CS1 
10 terminator sequence. As is the case with the promoters, different 
terminator sequences should be used for each gene. 

As described above, the gene construct can also comprise further 
genes to be introduced into the organisms. It is possible and 

15 advantageous to introduce into the host organisms, and to 

express, regulatory genes such as genes for inductors, repressors 
or enzymes which, owing to their enzyme activity, engage in the 
regulation of one or more genes of a biosynthesis pathway. These 
genes can be of heterologous or of homologous origin. Moreover, 

20 further biosynthesis genes of the fatty acid or lipid metabolism 
can advantageously be present in the nucleic acid construct, or 
gene construct; however, these genes can also be present on one 
or more further nucleic acid constructs. A biosynthesic gene of 
the fatty acid or lipid metabolism which is preferably chosen is 

25 a gene selected from the group acyl-CoA dehydrogenase ( s ) , 
acyl-ACP [=» acyl carrier protein] desaturase(s) , acyl-ACP 
thioesterase(s) , fatty acid acyltransf erase (s) , fatty acid 
synthase(s), fatty acid hydroxylase ( s) , acetyl-coenzyme A 
carboxylase^ ) , acyl-coenzyme A oxidase(s), fatty acid 

30 desaturase(s) , fatty acid acetylenases, lipoxygenases, 

triacylglycerol lipases, allene oxide synthases, hydroperoxide 
lyases or fatty acid elongase(s) or their combinations. 

In this context, the abovementioned desaturases can be cloned 
35 into expression cassette according to the invention in 

combination with elongases and other desaturases and employed for 
the transformation of plants with the aid of Agrobacterium. 

In this context, the regulatory sequences or factors can, as 
40 described above, have a positive effect on, preferably, the gene 
expression of the genes introduced, thus enhancing it. Thus, 
enhancement of the regulatory elements can advantageously take 
place at the transcriptional level by using strong transcription 
signals such as promoters and/or enhancers, in addition, however, 
45 enhancement of translation is also possible, for example by 

improving the stability of the hirna . In principle, the expression 
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cassettes can be used directly for introduction into the plant, 
or else be introduced into a vectors. 

These advantageous vectors , preferably expression vectors , 
5 comprise the nucleic acid which are used in the method and which 
encode A5- or A6-desatures or A6-eionagases, or a nucleic acid 
construct r which the nucleic acid used, alone or in combination 
with further biosynthetic genes of the fatty acid or lipid 
metabolism. As used in the present context, the term "vector" 

10 refers to a nucleic acid molecule which is capable of 

transporting another nucleic acid, to which it is bound. One type 
of vector is a "plasmid", which represents a circular double- 
stranded DNA loop into which additional DNA segments can be 
ligated. Another type of vector is a viral vector, it being 

15 possible for additional DNA segments to be ligated in the viral 
genome. Certain vectors are capable of autonomous replication 
in a host cell in which they have been introduced (for example 
bacterial vectors with bacterial origin of replication) . Other 
vectors are advantageously integrated in the genome of a host 

20 cell when being introduced into the host cell, whereby they 
replicate together with the host genome. Moreover, certain 
vectors are capable of governing the expression of genes with 
which they are operably linked. These vectors are referred to 
herein as "expression vectors". Usually, expression vectors which 

25 are suitable for DNA recombination techniques take the form of 
plasmids. In the present description, "plasmid" and "vector" can 
be used interchangeably since the plasmid is the most frequently 
used vector form. However the invention is also intended to 
comprise these other forms of expression vectors, such as viral 

30 vectors, which have similar functions. Furthermore, the term 

vector is also intended to comprise other vectors which are known 
to the skilled worker, such as phages, viruses such as SV4 0, CMV, 
TMV, transposons, IS elements, phasmids, phagemids, cosmids, 
linear or circular DNA. 

35 

The recombinant expression vectors which are advantageously used 
in the method comprise the nucleic acids described hereinbelow or 
the above-described gene construct in a form suitable for 
expressing these nucleic acids in a host cell, which means that 

40 the recombinant expression vectors comprise one or more 

regulatory sequences selected on the basis of the host cells to 
be used for the expression, which is linked operably with the 
nucleic acid sequence to be expressed. "Linked operably" in a 
recombinant expression vector means that the nucleotide sequence 

45 of interest is bound to the regulatory sequence(s) in such a way 
that the expression of the nucleotide sequence is possible and 
that they are bound with one another so that both sequences 
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fulfill the predicted function ascribed to the sequence (for 
example in an in-vitro transcription/translation system or in a 
host cell if the vector is introduced into the host cell) . The 
term "regulatory sequence" is intended to comprise promoters, 
5 enhancers and other expression control elements (for example 
polyadenylation signals). These regulatory sequences are 
described for example in Goeddel: Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, CA (1990), 
or see: Gruber and Crosby, in: Methods in Plant Molecular Biology 

10 and Biotechnolgy, CRC Press, Boca Raton, Florida, eds • : Glick and 
Thompson, chapter 7, 89-108, including the references therein. 
Regulatory sequences comprise those which govern the constitutive 
expression of a nucleotide sequence in many types of host cell 
and those which govern the direct expression of the nucleotide 

15 sequence only in specific host cells under specific conditions. 
The skilled worker'knows that the design of the expression vector 
can depend on factors such as the choice of the host cell to be 
transformed, the expression level of the desired protein and the 
like. - . 

20 ~ " 

The recombinant expression vectors used can be designed for 
expressing desaturases and elongases in prokaryotic or eukaryotic 
cells. This is advantageous since intermediate steps of vector 
construction are frequently performed in microorganisms for the 

25 sake of simplicity. For example, desaturase and/or elongase genes 
can be expressed in bacterial cells, insect cells (using 
baculovirus expression vectors), yeast cells and other fungal 
cells (see Romanos, M.A., et al. (1992) "Foreign gene expression 
in yeast: a review", Yeast 8:423-488; van den Hondel, C. A. M.J. J., 

30 et al. (1991) "Heterologous gene expression in filamentous 

fungi", in: More Gene Manipulations in Fungi, J.w. Bennet & L.L. 
Lasure, ed., pp. 396-428: Academic Press: San Diego; and van den 
Hondel, C .A.M. J .J. , & Punt, P.J. (1991) "Gene transfer systems 
and vector development for filamentous fungi, in: Applied 

35 Molecular Genetics of Fungi, Peberdy, J.F., et al., ed., pp. 
1-28, Cambridge University Press: Cambridge), Algen (Falciatore 
et al., 1999, Marine Biotechnology. 1, 3:239-251), ciliates of 
the types: Holotrichia, Peritrichia, Spirotrichia, Suctoria, 
Tetrahymena, Paramecium, Colpidium, Glaucoma, Platyophrya, 

40 Potomacus, Desaturaseudocohnilembus , Euplotes, Engelmaniella and 
Stylonychia, in particular the genus Stylonychia lertmae, using 
vectors by a transformation method as described in WO 98/01572, 
and preferably in cells of multi-celled plants (see Schmidt, R. 
and willmitzer, L. (1988) "High efficiency Agrobacterium 

45 t umef a ci ens-mediated transformation of Arabidopsis thaliana leaf 
and cotyledon explants" Plant Cell Rep. :583-586; Plant Molecular 
Biology and Biotechnology, C Press, Boca Raton, Florida, chapter 
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6/7, pp. 71-119 (1993); F.F. White, B. Jenes et al., Techniques 
for Gene Transfer, in: Transgenic Plants, vol. 1, Engineering and 
Utilization, ed.: Kung and R. Wu, Academic Press (1993), 128-43; 
Potrykus, Annu . Rev. Plant Physiol. Plant Molec. Biol. 4 2 (1991), 
5 205-225 (and references cited therein)). Suitable host cells are 
furthermore discussed in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, CA (1990). 
As an alternative, the recombinant expression vector can be 
transcribed and translated in vitro, for example using T7 
10 promoter regulation sequences and T7 polymerase. 

Protein expression in prokaryotes is usually performed with the 
aid of vectors which comprise constitutive or inducible promoters 
which govern the expression of fusion proteins or nonfusion 

15 proteins. Typical fusion expression vectors are, inter alia pGEX 
(Pharmacia Biotech" Inc; Smith, D.B., and Johnson, K.S. (1988) 
Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) , where glutathione S-transferase 
(GST), maltose-E-binding protein or protein A, respectively, is 

20 fused with the recombinant target "protein. 

Examples of suitable inducible nonfusion E. coli expression 
vectors are, inter. alia, pTrc (Amann et al. (1988) Gene 
69:301-315) and pET lid (Studier et al., Gene Expression 

25 Technology: Methods in Enzymology 185, Academic Press, San Diego, 
California (1990) 60-89). The target gene expression of the pTrc 
vector is based on the transcription of host RNA polymerase by a 
hybrid trp-lac fusion promoter. The target gene expression from 
the pET lid vector is based on the transcription of a T7-gnl0-lac 

30 fusion promoter, which is mediated by a coexpressed viral RNA 
polymerase (T7 gnl). This viral polymerase is provided by the 
host strains BL21 (DE3) or HMS174 (DE3) by a resident X prophage 
which harbors a T7 gnl gene under the transcriptional control of 
the lacUV 5 promoter. 

35 

Other vectors which are suitable for use in prokaryotic organisms 
are known to the skilled worker; these vectors are, for example 
in E. coli, pLG338, pACYC184, the pBR series such as pBR322, the 
pUC series such as pUC18 or pUC19, the Mll3mp series, pKC30, 
40 pRep4, pHSl, pHS2, pPLc236, pMBL24, pLG200, pUR290, pIN-III^^-Bl, 
Xgtll or pBdCI, in Streptomyces pIJlOl, pIJ364, pIJ702 or pIJ361, 
in Bacillus pUBllO, pC194 oder pBD214, in Corynebacterium pSA77 
or pAJ667. 

45 In a further embodiment, the expression vector is a yeast 
expression vector. Examples of vectors for expression in the 
yeast S. cerevisiae comprise pYeDesaturasecl (Baldari et al. 
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(1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz (1982) 
Cell 30:933-943) , pJRY88 (Schultz et al. (1987) Gene 54:113-123) 
and pYES2 (Invitrogen Corporation, San Diego, CA) . Vectors and 
methods for construction of vectors which are suitable for use in 
5 other fungi, such as the filamentous fungi, comprise those which 
are described in detail in: van den Hondel, C. A. M.J. J., & Punt, 
P.J. (1991) "Gene transfer systems and vector development for 
filamentous fungi, in: Applied Molecular Genetics of fungi, J.F. 
Peberdy et al., ed., pp. 1-28, Cambridge University Press: 
10 Cambridge, or in: More Gene Manipulations in Fungi [J.W. Bennet fit 
L.L. Lasure, ed., pp. 396-428: Academic Press: San Diego]. 
Further suitable yeast vectors are, for example, pAG-1, YEp6, 
YEpl3 or pEMBLYe23. 

15 As an alternative, the desaturases and/or elongases can be 

expressed in insect cells using baculovirus expression vectors. 
Baculovirus vectors which are available for expressing proteins 
in cultured insect cells (for example Sf9 cells) comprise the pAc 
series (Smith et al. (1983) Mol. Cell Biol.. 3:2156-2165) and the 

20 pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

The abovementioned vectors offer only a small overview over 
suitable vectors which are possible. Further plasmids are known 
to the skilled worker and are described, for example, in: Cloning 

25 Vectors (ed. Pouwels, P.H., et al., Elsevier, Amsterdam- 
New York-Oxford, 1985, ISBN 0 444 904018). Further suitable 
expression systems for prokaryotic and eukaryotic cells , see in 
the chapters 16 and 17 of Sambrook, J., Fritsch, E.F., and 
Maniatis, T., Molecular Cloning: A Laboratory Manual, 2nd 

30 edition, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989. 

In a further embodiment of the process, the desaturases and/or 
elongases can be expressed in single-cell plant cells (such as 

35 algae), see Falciatore et al., 1999, Marine Biotechnology 1 
(3):239-251 and references cited therein, and plant cells from 
higher plants (for example spermatophytes such as crops). 
Examples of plant expression vectors comprise those which are 
described in detail in: Becker, D • , Kemper, E., Schell, J., and 

40 Masterson, R. (1992) "New plant binary vectors with selectable 
markers located proximal to the left border". Plant Mol. Biol. 
20:1195-1197; and Bevan, M.W. (1984) "Binary Agrobacterium 
vectors for plant transformation", Nucl. Acids Res. 12:8711-8721; 
Vectors for Gene Transfer in Higher Plants; in: Transgenic 

45 Plants, vol. 1, Engineering and Utilization, ed.: Kung and R. Wu, 
Academic Press, 1993, pp. 15-38. 
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A plant expression cassette preferably comprises regulatory 
sequences which are capable of governing the gene expression in 
plant ceils and which are linked operably so that each sequence 
can fulfill its function, such as transcriptional termination,. 
5 for example polyadenylation signals. Preferred polyadenylation 
signals are those which originate from Agrobacterium tumef aciens 
T-DNA, such as the gene 3 of the Ti plasmid pTiACH5, which is 
known as octopine synthase (Gielen et al., EMBO J. 3 (1984) 
835ff.) or functional equivalents thereof, but all other 

10 terminators which are functionally active in plants are also 
suitable. 

Since plant gene expression is very often not limited to the 
transcriptional levels, a plant expression cassette preferably 
15 comprises other operably linked sequences such as translation 

enhancers, for example the overdrive sequence which comprises the 
5 '-untranslated leader sequence from tobacco mosaic virus, which 
increases the protein/RNA ratio (Gallie et al., 1987, Nucl. Acids 
Research 15:8693-8711), 

20. 

As described above, plant gene expression must be linked operably 
with a suitable promoter which performs gene expression with the 
correct timing or in a cell- or tissue-specific manner. 
Utili2able promoters are constitutive promoters (Benfey et al., 
25 EMBO J. 8 (1989) 2195-2202) such as those which are derived from 
plant viruses, such as 35S CAMV (Franck et al., Cell 21 (1980) 
285-294), 19S CaMV (see also US 5352605 and WO 84/02913) or plant 
promoters such as the Rubisco small subunit, which is described 
in US 4,962,028. 

30 

Other sequences which are preferred for the use for operable 
linkage in plant gene expression cassettes are targeting 
sequences, which are required for targeting the gene product into 
its relevant cell compartment (for a review see Kermode, Crit. 
35 Rev. Plant Sci. 15, 4 (1996) 285-423 and references cited 

therein), for example into the vacuole, the nucleus, all types of 
plastids, such as amyloplasts, chloroplasts, chromoplasts, the 
extracellular space, the mitochondria, the endoplasmic reticulum, 
oil bodies, peroxisomes and other plant cell compartments. 

40 

Plant gene expression can also be facilitated as described above 
via a chemically inducible promoter (for a review see Gatz 1997, 
Annu. Rev. Plant Physiol. Plant Mol. Biol., 48:89-108). 
Chemically inducible promoters are suitable in particular when it 
45 is desired that gene expression is clock-specific. Examples of 
such promoters are a salicylic acid-inducible promoter (WO 
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95/19443), a tetracyclin-inducible promoter (<Gatz et al- (1992) 
Plant J. 2, 397-404) and an ethanol-inducible promoter. 

Other promoters which are suitable are promoters which respond to 
5 biotic or abiotic stress conditions, for example the pathogen- 
induced PRP1 gene promoter (Ward et al., Plant. Mol. Biol. 22 
(1993) 361-366), the heat-inducible tomato hsp80 promoter (US 
5,187,267), the chill-inducible potato alpha-amylase promoter (WO 
96/12814) or the wound-inducible pinll promoter (EP-A-0 375 091). 

10 

Preferred promoters are in particular those which bring about the 
expression of genes in tissues and organs in which lipid and oil 
biosynthesis takes place, in seed cells, such as cells of the 
endosperm and of the developing embryo. Suitable promoters are 

15 the oilseed rape napin gene promoter (US 5,608,152), the Vicia 
faba USP promoter TBaeumlein et al., Mol Gen Genet, 1991, 225 
(3):459-67), the Arabidopsis oleosin promoter (WO-98/45461 ) , the 
Phaseolus vulgaris phaseolin promoter (US 5,504,200), the 
Brassica Bee 4 promoter (WO 91/13980) or- the legumin B4 promoter 

20. (LeB4; Baeumlein et al., 1992, Plant Journal, 2 (2):233-9), and 
promoters which bring about the seed-specific expression in 
monocots such as maize, barley, wheat, rye, rice and the like. 
Suitable promoters .which should be. taken into consideration are 
the barley lpt2 or Iptl gene promoter (WO 95/15389 and WO 

25 95/23230), or those described in WO 99/16890 (promoters from the 
barley hordein gene, the rice glutelin gene, the rice oryzin 
gene, the rice prolamin gene, the wheat gliadin gene, the wheat 
glutelin gene, the maize zein gene, the oat glutelin gene, the 
sorghum kasirin gene, the rye secalin gene). 

30 

In particular, it may be desired to bring about the multiparallel 
expression of the desaturases and/or elongases used in the method 
alone or in combination with other desaturases or elongases. Such 
expression cassettes can be introduced via the simultaneous 
35 transformation of a plurality of individual expression constructs 
or, preferably, by combining a plurality of expression cassettes 
on one construct. Also, it is possible to transform a plurality 
of vectors with in each case a plurality of expression cassettes 
and to transfer them to the host cell. 

40 

Promoters which are likewise especially suitable are those which 
bring about the plastid-specif ic expression since plastids are 
the compartment in which the precursors and some end products of 
lipid biosynthesis are synthetized. Suitable promoters such as 
45 the viral RNA polymerase promoter are described in WO 95/16783 
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and WO 97/06250, and the Arabidopsis clpP promoter , described in 
WO 99/46394. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells 
5 via conventional transformation or transfection techniques. The 
terms "transformation" and "transfection", conjugation and 
transduction, as used in the present context, are meant to 
comprise a multiplicity of methods known in the art for 
introducing foreign nucleic acid (for example DNA) into a host 

10 cell, including calcium phosphate or calcium chloride 

coprecipitation, DEAE-dextran-mediated transfection, lipofection, 
natural competence, chemically mediated transfer, 
electroporation or particle bombardment . Suitable methods for the 
transformation or transfection of host cells, including plant 

15 cells, can be found in Sambrook et al. (Molecular Cloning: A 
Laboratory Manual."; 2nd edition., Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989) and other laboratory handbooks such as Methods in Molecular 
Biology, 1995, vol. 44, Agrobacterium protocols, ed.: Gartland 

20 and Davey, Humana Press, Totowa, New Jersey. 

Host cells which are suitable in principle for taking up the 
nucleic acid according to the invention, the gene product 
according to the invention or the vector according to the 

25 invention are all prokaryotic or eukaryotic organisms. The host 
organisms which are advantageously used are organisms such as 
bacteria, fungi, yeasts or plant cells, preferably plants or 
parts thereof. Fungi, yeasts or plants are used by preference; 
especially preferably plants, very especially preferably plants 

30 such as oil crops which comprise large amounts of lipid 

compounds, such as oilseed rape, evening primrose, hemp, thistle, 
peanut, canola, linseed, soya, safflower, sunflower, borage or 
plants such as maize, wheat, rye, oats, triticale, rice, barley, 
cotton, cassava* pepper, Tagetes, Solanaceae plants such as 

35 potato, tobacco, eggplant and tomato, Vicia species, pea, 

alfalfa, bushy plants (coffee, cocoa, tea), Salix species, trees 
(oil palm, coconut) and perennial grasses and fodder crops. 
Especially preferred plants according to the invention are oil 
crops such as soya, peanut, oilseed rape, canola, linseed, hemp, 

40 evening primrose, sunflower, safflower, trees (oil palm, 
coconut) . 

Nucleic acid sequences which are advantageously used in the 
process according to the invention are those which encode 
45 polypeptides with a A6-desaturase activity, A6-elongase activity 
or A5-desaturase activity, selected from the group consisting of: 
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a) a nucleic acid sequence with the sequence shown in SEQ ID 
NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID 
NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID 

5 NO: 25, SEQ ID NO: 27, SEQ ID NO: 29 or SEQ ID NO: 31, 

b) nucleic acid sequences which, owing to the degeneracy of the 
genetic code, are obtained by back translation of the amino 
acid sequences shown in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 

10 NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 

NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID 
NO: 30 or SEQ ID NO: 32, 

15 c) derivatives of the nucleic acid sequences shown in SEQ ID 
NO: 1, SEQ ID "NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID 
NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID 
NO: 25, SEQ ID NO: 27, SEQ ID NO: 29 or SEQ ID NO: 31 which 

20 encode polypeptides with the amino acid sequences shown in 

SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 
ID NO: 18, SEQ ID NO: 20, SEQ . ID NO: 22, SEQ ID NO: 24, SEQ 
ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30 or SEQ ID NO: 32 and 

25 which have at least 50% homology at the amino acid level, 

without the enzymatic activity of the polypeptides being 
substantially reduced. 

The abovementioned nucleic acid according to the invention 
30 originates from organisms such as animals, ciliates, fungi, 
plants such as algae or dinof lagellates which are capable of 
synthesizing PUFAs. 

The term "nucleic acid (molecule)" as used in the present context 
35 also comprises the untranslated sequence located at the 3' and at 
the 5' end of the coding gene region: at least 500, preferably 
200, especially preferably 100 nucleotides of the sequence 
upstream of the 5' terminus of the coding region and at least 
100, preferably 50, especially preferably 20 nucleotides of the 
40 sequence downstream of the 3' end of the coding gene region. An 
"isolated" nucleic acid molecule is separated from other nucleic 
acid molecules which are present in the natural source of the 
nucleic acid- An "isolated" nucleic acid preferably has no 
sequences which naturally flank the nucleic acid in the genomic 
45 DNA of the organism from which the nucleic acid originates (for 
example sequences which are present at the 5 ' and 3 ' ends of the 
nucleic acid). In different embodiments, the isolated desaturase 
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or elongase nucleic acid molecule may comprise, for example less 
than approximately 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb 
of nucleotide sequences which naturally flank the nucleic acid 
molecule in the genomic DNA of the cell from which the nucleic 
5 acid originates. 

The nucleic acid molecules used in the process, for example a 
nucleic acid molecule with a nucleotide sequence of the SEQ ID 
NO: 1 or a part thereof, can be isolated using molecular- 

10 biological standard techniques and the sequence information 
provided herein. Also, for example a homologous sequence or 
homologous, conserved sequence regions at the DNA or amino acid 
level can be identified with the aid of comparative algorithms. 
They can be used as hybridization probe and standard 

15 hybridization techniques (as described, for example, in Sambrook 
et al., Molecular Cloning: A Laboratory Manual. 2nd ed.. Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, 1989) for the isolation of further 
nucleic acid sequences which are useful in the process. Moreover, 

20 a nucleic acid molecule comprising a complete sequence of the SEQ 
ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 
31 or a part thereof can be isolated by polymerase chain 
reaction, where oligonucleotide primers, which are used on the 
basis of this sequence or parts thereof (for example, it is 

25 possible to isolate a nucleic acid molecule comprising the 

complete sequence or a part thereof by means of polymerase chain 
reaction using oligonucleotide primers which have been generated 
on the basis of the same sequence) . For example, mRNA can be 
isolated from cells (for example by means of the guanidinium 

30 thiocyanate extraction method of Chirgwin et al. (1979) 

Biochemistry 18:5294-5299) and cDNA can be generated by means of 
reverse transcriptase (for example Moloney MLV Reverse 
Transcriptase, available from Gibco/BRL, Bethesda, MD, or AMV 
Reverse Transcriptase, available from Seikagaku America, Inc., 

35 St .Petersburg, FL). Synthetic oligonucleotide primers for 
amplification by means of polymerase chain reaction can be 
generated based on one of the sequences shown in SEQ ID NO: 1, 3, 
5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 and that of 
Figure 5a, or with the aid of the amino acid sequences shown in 

40 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 
or 32. A nucleic acid according to the invention can be amplified 
in accordance with standard PCR amplification techniques using 
cDNA or, alternatively, genomic DNA as template and suitable 
oligonucleotide primers. The nucleic acid amplified thus can be 

45 cloned into a suitable vector and characterized by means of DNA 
sequence analysis. Oligonucleotides which correspond to a 
desaturase nucleotide sequence can be generated by means of 
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synthetic standard methods , for example using an automatic DNA 
synthesizer. 

Homologs of the desaturase or elongase nucleic acid sequences 
5 used, with sequence SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 11, 19, 
21, 23, 25, 27, 29 or 31, means for example allelic variants with 
at least approximately 50 to 60%, preferably at least 
approximately 60 to 70%, more preferably at least approximately 
70 to 80%, 80 to 90% or 90 to 95% and even more preferably at 

10 least approximately 95%, 96%, 97%, 98%, 99% or more homology with 
one of the nucleotide sequences shown in SEQ ID NO: 1, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 or their 
homologs, derivatives or analogs, or parts of these. Moreover, 
isolated nucleic acid molecules of a nucleotide sequence which 

15 hybridize with one of the nucleotide sequences shown in SEQ ID 
NO: 1, 3, 5, 1, 9,-11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 
or part thereof, hybridize for example under stringent 
conditions. Allelic variants comprise in particular functional 
variants which can be obtained by deletion, insertion or 

20 substitution of nucleotides from/into the sequence shown in 
SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29 or 31, it being intended, however, that the enzyme activity of 
the resulting synthesized proteins is advantageously retained for 
the insertion of one or more genes. Proteins which retain the 

25 enzymatic activity of the desaturase or elongase, i.e. whose 

activity is essentially not reduced, means proteins with at least 
10%, preferably 20%, especially preferably 30%, very especially 
preferably 40% of the original enzyme activity in comparison with 
the protein encoded by SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 

30 20, 22, 24, 26, 28, 30 or 32. 

Homologs of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29 or 31 mean for example also bacterial, fungal and 
plant homologs , truncated sequences, single-stranded DNA or RNA 
35 of the coding and noncoding DNA sequence. 

Homologs of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29 or 31 also means derivatives such as, for 
example, promoter variants. The promoters upstream of the 

40 abovementioned nucleotide sequences can be modified by one or 
more nucleotide substitutions, insertion(s) and/or deletion(s) 
without, however, interfering with the functionality or activity 
of the promoters. Moreover, it is possible to increase the 
activity of the promoters by modifying their sequence or to 

45 replace them completely by more active promoters, including 
promoters from heterologous organisms. 
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The abovementioned nucleic acids and protein -molecules with 
desaturase or elongase activity which are involved in the 
metabolism of lipids and fatty acids , PUFA cof actors and enzymes 
or in the transport of lipophilic compounds across membranes are 
5 used in the process according to the invention for the modulation 
of the production of compounds of the general formula I in 
transgenic plants such as maize , wheat, rye, oats, triticale, 
rice, barley, soybean, peanut, cotton, Linum species such as 
linseed or flax, Brassica species such as oilseed rape, canola 

10 and turnip rape, pepper , sunflower, borage, evening primrose and 
Tagetes, Solanaceae plants such as potato, tobacco, eggplant and 
tomato, Vicia species, pea, cassava, alfalfa, bushy plants 
(coffee, cocoa, tea), Salix species, trees (oil palm, coconut) 
and perennial grasses and fodder crops, either directly (for 

15 example when the overexpression or optimization of a fatty acid 
biosynthesis protein has a direct effect on the yield, production 
and/or production efficiency of the fatty acid from modified 
organisms) and/or can have an indirect effect which nevertheless 
leads to an increase in the yield, production and/or production 

20 efficiency of a desired compound or a decrease in undesired 

compounds (for example when the modulation of the metabolism of 
lipids and fatty acids, cof actors and enzymes leads to 
modifications of the yield, production and/or production 
efficiency or the composition of the desired compounds within the 

25 cells, which, in turn, may have an effect on the production of 
one or more fatty acids). 

The combination of different precursor molecules and biosynthetic 
enzymes results in the production of different fatty acid 
30 molecules, which has a decisive effect on lipid composition. 
Since polyunsaturated fatty acids (= PUFAs) are not simply 
incorporated into triacylglycerol, but also into membrane lipids. 

Lipid synthesis- can be divided into two sections: the synthesis 
35 of fatty acids and their binding to sn-glycerol-3-phosphate, and 
the addition or modification of a polar head group. Conventional 
lipids which are used in membranes comprise phospholipids, 
glycolipids, sphingolipids and phosphoglycerides . Fatty acid 
synthesis starts with the conversion of acetyl-CoA into 
40 malonyl-CoA by the enzyme acetyl-CoA carboxylase or into 
acetyl-ACP by the enzyme acetyl transacylase. After a 
condensation reaction, these two product molecules together form 
acetoacetyl-ACP, which is converted by a series of condensation, 
reduction and dehydratization reactions so that a saturated fatty 
45 acid molecule with the desired chain length is obtained. The 

production of the unsaturated fatty acids from these molecules is 
catalyzed by specific desaturases, either aerobically by means of 
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molecular oxygen or anaerobically (as regards the fatty acid 
synthesis in microorganisms, see F.C. Neidhardt et al. (1996) E. 
coli and Salmonella. ASM Press: Washington, D.C., pp. 612-636 and 
references cited therein; Lengeler et al. (ed.) (1999) Biology of 
5 Procaryotes. Thierne: Stuttgart, New York, and references therein, 
and Magnuson, K. , et al. (1993) Microbiological Reviews 
57:522-542 and the references therein). 

Examples of precursors for PUFA biosynthesis are oleic acid, 

10 linoleic acid and linolenic acid. These C 18 -carbon fatty acids 
must be elongated to C 2 o and C 22 to obtain fatty acids of the 
eicosa and docosa chain type. With the aid of the desaturases 
used in the process, such as A5- and A6-desaturase and 
A6-elongase, it is possible to obtain arachidonic acid and 

15 eicosapentaenoic acid and various other long-chain PUFAs, to 

extract them and to use them for various purposes in applications 
in foodstuffs, feeding stuffs, cosmetics or pharmacology. Using 
the abovementioned enzymes, it is possible to produce preferably 
Cie + c 2o fatty acids with at least two,- three, four or five 

20 double bonds in the fatty acid molecule, preferably C 2 o-fatty 

acids with advantageously three, four or five double bonds in the 
fatty acid molecule. Desaturation can take place before or after 
elongation of the fatty acid in question. This is why the 
products of desaturase activities and the further desaturation 

25 and elongation which are possible give rise to preferred PUFAs 
with a higher degree of desaturation, including a further 
elongation from C 20 to C 22 -fatty acids, to give fatty acids such 
as y-linolenic acid, dihomo-v- linolenic acid, arachidonic acid, 
stearidonic acid, eicosatetraenoic acid or eicosapentaenoic acid. 

30 Substrates in the process according to the invention are, for 
example, linoleic acid, Y-linolenic acid, a-linolenic acid, 
dihomo-Y-linolenic acid, eicosatetraenoic acid or stearidonic 
acid. Preferred substrates are linoleic acid, Y-linolenic acid 
and/or a-linolenic acid, dihomo-Y-linolenic acid or arachidonic 

35 acid, eicosatetraenoic acid or eicosapentaenoic acid, 

respectively. The C 18 - or C 20 -fatty acids with at least two double 
bonds in the fatty acid are obtained in the process according to 
the invention in the form of the free fatty acid or in the form 
of its esters (see formula I), for example in the form of its 
40 glycerides. 

The term "glyceride" is understood as meaning a glycerol which is 
esterified with one, two or three carboxylic acid residues 
(mono-, di- or triglyceride) . "Glyceride" is also understood as 
45 being a mixture of various glycerides. The glyceride, or 

glyceride mixture, may comprise further additions, for example 
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free fatty acids, antioxidants, proteins, carbohydrates, vitamins 
and/or other substances. 

A "glyceride" for the purposes of the process according to the 
5 invention is furthermore understood as meaning glycerol-derived 
derivatives. These include, in addition to the above-described 
fatty acid glycerides, glycerophospholipids and glyceroglyco- 
lipids. Preferred examples which may be mentioned in this context 
are the glycerophospholipids such as lecithin (phosphatidyl- 
10 choline), caxdiolipin, phosphatidylglycerol, phosphatidylserine 
and alkylacylglycerophospholipids . 

Furthermore, fatty acids must subsequently be translocated to 
various sites of modification and incorporated into the 
15 triacylglycerol storage lipid. A further important step in lipid 
synthesis is the transfer of fatty acids on the polar head 
groups, for example by the enzyme glycerol fatty acid 
acyltransferase (see Frentzen, 1998, Lipid, 100(4-5) :161-166). 

20. Publications on plant fatty acid biosynthesis, desaturation, the 
lipid metabolism and membrane transport of lipidic compounds, 
bet a -oxidation, fatty acid modification and cof actors, 
triacylglycerol storage and triacylglycerol assembly including 
the references cited therein, see the following papers: Kinney, 

25 1997, Genetic Engeneering, ed. : JK Setlow, 19:149-166; Ohlrogge 
and Browse, 1995, Plant Cell 7:957-970; Shanklin and Cahoon, 
1998, Annu. Rev. Plant Physiol. Plant Mol. Biol. 49:611-641; 
Voelker, 1996, Genetic Engeneering, ed.: JK Setlow, 18:111-13; 
Gerhardt, 1992, Prog. Lipid R. 31:397-417; Giihnemann-SchSfer & 

30 Kindl, 1995, Biochim. Biophys Acta 1256:181-18 6; Kunau et al., 
1995, Prog. Lipid Res. 34:267-342; Stymne et al., 1993, in: 
Biochemistry and Molecular Biology of Membrane and Storage Lipids 
of Plants, ed.: Murata and Somerville, Rockville, American 
Society of Plant Physiologists, 150-158, Murphy & Ross 1998 , 

35 Plant Journal. 13(1) : 1-16. 

The PUFAs produced in the process comprise a group of molecules 
which higher animals are no longer capable of synthesizing and 
must therefore take up, or which higher animals can no longer 
40 synthesize themselves in sufficient amounts and must thus 

additionally take them up, although they are synthesized readily 
by other organisms such as bacteria; for example, cats are no 
longer capable of synthesizing arachidonic acid. 

45 For the purposes of the invention, the terms "desaturase or 
elongase" or "desaturase or elongase polypeptide'' comprises 
proteins which are implicated in the desaturation and elongation 



0093/00064 



CA 02485060 2004-10-26 



36 

of fatty acids , and their homologs, derivatives or analogs. The 
terms desaturase or elongase nucleic acid sequence(s) comprise 
nucleic acid sequences which encode a desaturase or elongase and 
in which a part can be a coding region and likewise corresponding 
5 5'- and 3 ' -untranslated sequence regions. The terms production or 
producitivity are known in the art and comprise the concentration 
of the fermentation product (compound of the formula I) which is 
formed within a specified period of time and a specified 
fermentation volume ( for example kg of product per hour per 

10 liter). The term production efficiency comprises the time span 
required for obtaining a specific amount of product (for example 
the time required by the cell for establishing a certain 
throughput rate of a fine chemical) . The term yield or 
product /carbon yield is known in the art and comprises the 

15 efficiency with which the carbon source is converted into the 
product (i.e. the fine chemical). This is usually expressed as, 
for example, kg of product per kg of carbon source. Increasing 
the yield or production of the compound results in increasing the 
amount of resulting molecules or the suitable resulting molecules 

20 of this compound in a certain amount of culture over a specified 
period. The terms biosynthesis or biosynthetic pathway are known 
in the art and comprise the synthesis of a compound, preferably 
an organic compound, by a cell starting from intermediates, for 
example in a multi-step process which is strongly regulated. The 

25 terms catabolism or catabolic pathway are known in the art and 
comprise the cleavage of a compound, preferably an organic 
compound, by a cell to give catabolites (in more general germs, 
smaller or less complex molecules), for example in a multi-step 
process which is strongly regulated. The term metabolism is known 

30 in the art and comprises the totality of the biochemical 

reactions which take place in an organism. The metabolism of a 
certain compound (for example the metabolism of a fatty acid) 
thus comprises the totality of the biosynthetic pathways, 
modified pathways and catabolic pathways of this compound in the 

35 cell which relate to this compound. 

In a further embodiment, derivatives of the nucleic acid molecule 
according to the invention encode proteins with at least 50% r 
advantageously approximately 50 to 60%, preferably at least 

40 approximately 60 to 70% and more preferably at least 

approximately 70 to 80%, 80 to 90%, 90 to 95% and most preferably 
at least approximately 96%, 97%, 98%, 99% or more homology (= 
identity) with a complete amino acid sequence of the SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 or 32. 

45 The homology of the amino acid sequence can be determined over 
the entire sequence region using the program PileUp (J. Mol. 
Evolution., 25, 351-360, 1987, Higgins et al. , CABIOS, 5, 
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1989:151-153) or BESTFIT or GAP (Henikoff, S; and Henikoff, J. G. 
(1992). Amino acid substitution matrices from protein blocks. 
Proc. Natl. Acad. Sci. USA 89: 10915-10919.) 

5 Moreover , the invention comprises nucleic acid molecules which 
differ from one of the nucleotide sequences shown in SEQ ID 
NO: 1, 3, 5 or 11 (and parts thereof) as the result of the 
degeneracy of the genetic code and which thus encode the same 
desaturase as the desaturase which is encoded by the nucleotide 
10 sequences shown in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13 , 15/ 17, 19, 
21, 23, 25, 27, 29 or 31. 

In addition to the desaturase nucleotide sequences shown in SEQ 
ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 

15 31, the skilled worker will recognize that DNA sequence 

polymorphisms which result in modifications in the amino acid 
sequences of the desaturases or elongases may exist within a 
population. These genetic polymorphisms in the desaturase or 
elongase gene may exist between individuals within a population 

20 as the result of natural* variation. These natural variants 

usually bring about a variance of from 1 to 5% in the nucleotide 
sequence of the desaturase or elongase gene. All and sundry of 
these nucleotide variations and resulting amino acid 
polymorphisms in the enzyme desaturase or elongase which are the 

25 result of natural variation and which do not modify the 

functional activity of desaturases or elongases are also intended 
to fall under the scope of the invention. 

Nucleic acid molecules which are advantageous for the process 

30 according to the invention can be isolated on the basis of their 
homology with the desaturase or elongase nucleic acids disclosed 
herein using the sequences or part thereof as hybridization 
probe, following standard hybridization techniques under 
stringent hybridization conditions. In this context, it is 

35 possible for example to use isolated nucleic acid molecules which 
are at least 15 nucleotides in length and which hybridize under 
stringent conditions with the nucleic acid molecules which 
comprise a nucleotide sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, 29 or 31. It is also possible to 

40 use nucleic acids with at least 25, 50, 100, 250 or more 

nucleotides. The term "hybridizes under stringent conditions" as 
used in the present context is understood as describing 
hybridization and wash conditions under which nucleotide 
sequences with at least 60% homology with one another usually 

45 remain hybridized with one another. The conditions are preferably 
such that sequences which are at least approximately 65%, more 
preferably at least approximably 70% and even more preferably at 
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least approximately 75% or more homologous with one another 
usually remain hybridized with one another. These stringent 
conditions are known to the skilled worker and can be found in 
Current Protocols in Molecular Biology , John Wiley 5 Sons, N. Y. 
5 (19 89), 6. 3. 1-6 • 3 .6. A preferred nonlimiting example of stringent 
hybridization conditions is hybridization in 6 x sodium 
chloride/sodium citrate (SSC) at approximately 45°C, followed by 
one or more wash steps in 0.2 x SSC, 0.1% SDS at 50 to 65°C. The 
skilled worker knows that these hybridization conditions differ 

10 depending on the type of the nucleic acid and, for example when 
organic solvents are used, with regard to the temperature and 
concentration of the buffer. For example, under "standard 
hybridization conditions" the temperature differs depending on 
the type of nucleic acid between 42°C and 58°C in aqueous buffer 

15 with a concentration of 0.1 to 5 x SSC (pH 7.2). If organic 

solvent is present' in the abovementioned buffer, for example 50% 
formamide, the temperature under standard conditions is 
approximately 42°C. The hybridization conditions for DNArDNA 
hybrids preferably are for example 0.1 <x SSC and 20°C to 45°c, 

20 preferably between 30°C and 45°C. The hybridization conditions for 
DNA : RNA hybrids preferably are for example 0.1 x SSC and 30°C to 
55°C, preferably between 45°C and 55°C. The abovementioned 
hybridization temperatures are determined for example for a 
nucleic acid with a length of approximately 100 bp (= base pairs) 

25 and a G + C content of 50% in the absence of formamide. The 

skilled worker knows how to identify the hybridization conditions 
required with the aid of textbooks, such as the one mentioned 
above, or the following textbooks: Sambrook et al., "Molecular 
Cloning", Cold Spring Harbor Laboratory, 1989? Hames and Higgins 

30 (ed.) 1985, "Nucleic Acids Hybridization: A Practical Approach", 
IRL Press at Oxford University Press, Oxford; Brown (ed.) 1991, 
"Essential Molecular Biology: A Practical Approach", IRL Press at 
Oxford University Press, Oxford. 

35 To determine the percentage homology (= identity) of two amino 
acid sequences (for example of the sequences of SEQ ID NO: 2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 or 32) or of two 
nucleic acids (for example one of the sequences of SEQ ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31), the 

40 sequences are written underneath each other to provide an optimal 
comparison (for example, gaps may be introduced into the sequence 
of a protein or a nucleic acid in order to generate an optimal 
alignment with the other protein or the other nucleic acid). The 
amino residues of nucleotides at the corresponding amino acid 

45 positions or nucleotide positions are then compared. If a 

position in a sequence is occupied by the same amino acid residue 
or the same nucleotide as the corresponding position in the other 
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sequence, the molecules are homologous at this position (i.e. 
amino acid or nucleic acid "homology" as used in the present 
context corresponds to amino acid or nucleic acid "identity"). 
The percentage homology between the two sequences is a function 
5 of the number of identical positions which the sequences share 
(i.e. percent homology - number of identical positions/total 
number of positions x 100). The terms homology and identity are 
thus to be regarded as synonymous. 

10 An isolated nucleic acid molecule which encodes a desaturase or 
elongase which is homologous to a protein sequence of SEQ ID NO: 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 or 32 can 
be generated by introducing one or more nucleotide substitutions, 
additions or deletions into a nucleotide sequence of SEQ ID 

15 NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 
so that one or more amino acid substitutions, additions or 
deletions are introduced into the protein which is encoded. 
Mutations can be introduced into one of the sequences of SEQ ID 
NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 

20 by standard techniques such as site-specific mutagenesis and 
PCR-mediated mutagenesis. It is preferred to generate 
conservative amino acid substitutions at one or more of the 
predicted nonessential amino acid residues. In a "conservative 
amino acid substitution", the amino acid residue is substituted 

25 by an amino acid residue with a similar side chain. Families of 
amino acid residues with similar side chains have been defined 
in the art. These families comprise amino acids with basic side 
chains (for example lysine, arginine, histidine), acidic side 
chains (for example aspartic acid, glutamic acid), uncharged 

30 polar side chains (for example glycine, asparagine, glutamine, 
serine, threonine, tyrosine, cysteine), unpolar side chains (for 
example alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan), beta-branched side chains 
(for example threonine, valine, isoleucine) and aromatic side 

35 chains (for example tyrosine, phenylalanine, tryptophan, 

histidine). A predicted nonessential amino acid residue in a 
desaturase or elongase is thus preferably substituted by another 
amino acid residue from the same family of side chains. As an 
alternative, the mutations can, in a different embodiment, be 

40 introduced randomly over the entire desaturase-encoding sequence 
or part thereof, for example by means of saturation mutagenesis, 
and the resulting mutants can be screened for the desaturase - 
activity described herein in order to identify mutants which 
retain the desaturase or elongase activity. After the mutagenesis 

45 of one of the sequences of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 
17, 19, 21, 23, 25, 27, 29 or 31 the encoded protein can be 



0093/00064 



CA 02485060 2004-10-26 



40 

expressed recombinantly, and the activity of -the protein can be 
determined for example using the assays described herein* 

The invention is illustrated further by the examples which 
5 follow, but which are not to be construed as limiting. The 
content of all of the references, patent applications, patents 
and published patent applications cited in the present patent 
application is herein incorporated by reference. 

10 Examples section 

Example 1: General methods 

a) General cloning methods: 

15 

Cloning methods such as, for example, restriction cleavages, 
agarose gel electrophoresis, purification of DNA fragments, 
transfer of nucleic acids onto nitrocellulose and nylon 
membranes, linking of DNA fragments, transformation of 

20. Escherichia coli and yeast cells, bacterial cultures and sequence 
analysis of recombinant DNA were carried out as described in 
Sambrook et al. (1989) (Cold Spring Harbor Laboratory Press: 
ISBN 0-87969-309-6). or Kaiser, Michaelis and Mitchell (1994) 
"Methods in Yeast Genetics" (Cold Spring Harbor Laboratory Press: 

25 ISBN 0-87969-451-3). 

b) Chemicals 

Unless otherwise stated in the text, the chemicals used were 
30 obtained in analytical-grade quality from Fluka (Neu-Ulm), Merck 
(Darmstadt), Roth (Karlsruhe), Serva (Heidelberg) and 
Sigma (Deisenhofen) . Solutions were made with purified, 
pyrogen-free water, hereinbelow referred to as H 2 0, from a Milli-Q 
Water System water purification system (Millipore, Eschborn). 
35 Restriction endonucleases, DNA-modifying enzymes and molecular 
biology kits were obtained from AGS (Heidelberg), Amersham 
(Braunschweig), Biometra (Gottingen), Boehringer (Mannheim), 
Genomed (Bad Oeynhausen), New England Biolabs 
(Schwalbach/Taunus) , Novagen (Madison, Wisconsin, USA), 
40 Perkin-Elmer (Weiterstadt ) , Pharmacia (Freiburg), Qiagen (Hilden) 
and Stratagene (Amsterdam, Netherlands). Unless otherwise 
specified, they were used in accordance with the manufacturer's 
instructions . 



45 
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Example 2: Isolation of total RNA and poly (A) 4 -RNA from plants 

Total RNA is isolated from plants such as linseed and oilseed 
rape and the like following a method described by Logemann et al. 
5 (1987, Anal. Biochem. 163, 21). The total RNA can be obtained 
from protonemal tissue from moss using the GTC method (Reski 
et al., 1994, Mol. Gen. Genet., 244:352-359). 

Example 3: Transformation of Agrobacterium 

10 

The Agrobacterium-mediated transformation of plants can be 
carried out for example using the Agrobacterium tumefaciens 
strain GV3101- (pMP90-) (Koncz and Schell, Mol. Gen. Genet. 204 
(1986) 383-396) or LBA4404- (Clontech) or C58C1 pGV2260 (Deblaere 
15 et al 1984, Nucl. Acids Res. 13, 4777-4788)). The transformation 
can be carried out* by standard transformation techniques (also 
Deblaere et al. 1984). 

Example 4: Plant transformation 

20 

The Agrobacterium-mediated transformation of plants can be 
carried out using standard transformation and regeneration 
techniques (Gelvin r Stanton B., Schilperoort , Robert A., Plant 
Molecular Biology Manual, 2nd ed., Dordrecht: Kluwer Academic 
25 Publ., 1995, in Sect., Ringbuch Zentrale Signatur: BTll-P ISBN 
0-7923-2731-4; Glick, Bernard R. f Thompson, John E • , Methods in 
Plant Molecular Biology and Biotechnology, Boca Raton: CRC Press, 
1993, 360 S., ISBN 0-8493-5164-2). 

■ 

30 Oilseed rape can be transformed by means of cotyledon or 

hypocotyl transformation (Moloney et al., Plant Cell 8 (1989) 
238-242; De Block et al., Plant Physiol. 91 (1989) 694-701). The 
use of antibiotics for the seletion of agrobacteria and plants 
depends on the Agrobacterium strain and the binary vector used 

35 for the transformation. Normally, oilseed rape is selected using 
kanamycin as selectable plant marker. 

The Agrobacterium-mediated gene transfer into linseed (Linum 
usitatissimum) can be carried out using for example a technique 
40 described by Mlynarova et al. (1994) Plant Cell Report 
13:282-285. 

The transformation of soya can be carried out using for example a 
technique described in EP-A-0 0424 047 (Pioneer Hi-Bred 
International) or in EP-A-0 0397 687, US 5,376,543, US 5,169,770 
45 (University Toledo). 
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The transformation of plants using particle bombardment, 
polyethylene glycol mediated DNA uptake or via the silicon 
carbonate fiber technique is described for example by Freeling 
and Walbot "The maize handbook" (1993) ISBN 3-540-97826-7, 
5 Springer Verlag New York). 

Example 5: Plasmids for plant transformation 

Binary vectors such as pBinAR (Hofgen and Willmit2er, Plant 

10 Science 66 (1990) 221-23 0) or pGPTV (Becker et al 1992, Plant 
Mol. Biol. 20:1195-1197) can be used for plant transformation. 
The binary vectors can be constructed by ligating the cDNA in 
sense or antisense orientation into T-DNA. 5' of the cDNA, a 
plant promoter activates cDNA transcription. A polyadenylation 

15 sequence is located 3' of the cDNA. The binary vectors can bear 
different marker genes. In particular , the nptll marker gene, 
which encodes kanamycin resistance conferred by neomycin 
phosphotransferase, can be substituted by the herbicide-resistant 
form of an acetolactate synthase gene (AHAS or ALS). The als gene 

20 is described in Ott et al., J. Mol". Biol. 1996, 263:359-360. The 
v-ATPase-cl promoter can be cloned into plasmid pBin!9 or pGPTV 
and used for the expression of the marker gene by cloning 
upstream of the ALS coding region. The abovementioned promoter 
corresponds to a 1153 base-pair fragment from Beta vulgaris 

25 (Plant Mol Biol, 1999, 39:463-475). In this context, not only 
sulfonylureas, but also imidazolinones such as imazethapyr or 
sulphonylureas may be used as antimetabolites for the selection. 

Tissue-specific expression can be achieved using a 
30 tissue-specific promoter. For example, seed-specific expression 
can be achieved by cloning the DC3 or LeB4 or USP promoter or the 
phaseolin promoter 5' of the cDNA. However, any other 
seed-specific promoter element such as, for example, the napin or 
arcelin promoter (Goossens et al. 1999, Plant Phys. 
35 120(4) : 1095-1103 and Gerhardt et al. 2000, Biochimica et 

Biophysica Acta 1490( 1-2 ) :87-98 ) may also be used. The CaMV-35S 
promoter or a v-ATPase CI promoter can be used for constitutive 
expression in the intact plants. 

40 In particular, genes encoding desaturases and elongases can be 
cloned into a binary vector one after the other by constructing a 
plurality of expression cassettes in order to mimic the metabolic 
pathway in plants. 

45 Within an expression casette, the protein to be expressed can be 
targeted into a cellular compartment using a signal peptide, for 
example for plastids, mitochondria or the endoplasmic reticulum 
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(Kermode, Crit. Rev. Plant Sci. 15, 4 ( 1996 ) -285-423) . The signal 
peptide is cloned 5' in the reading frame with the cDNA to 
achieve the subcellular localization of the fusion protein. 

5 Examples of multiexpression cassettes are given hereinbelow. 

I.) Promoter-terminator cassettes 

Expression cassettes consist of least two functional units such 
10 as a promoter and a terminator. Further desired gene sequences 
such as targeting sequences, coding regions of genes or parts 
thereof and the like can be inserted between promoter and 
terminator. To construct expression cassettes, promoters and 
terminators (USP promoter: Baeumlein et al., Mol Gen Genet, 1991, 
15 225 (3):459-67); OCS terminator: Gielen et al. EMBO J. 3 (1984) 
835ff.) are isolated with the aid of the polymerase chain 
reaction and tailor-made with flanking sequences of choice on the 
basis of synthetic oligonucleotides. 

20. Examples of oligonucleotides which* can be used are the following: 

USP1 upstream: CCGGAATTCGGCGCGCCGAGCTCCTCGAGCAAATTTACACATTGCCA 
USP2 upstream: C C GG AATTC GGC GC GCC GAGCTC C TC GAG C AAAT TT AC AC ATTGC C A 

25 

USP3 upstream: CCGGAATTCGGCGCGCCGAGCTCCTCGAGCAAATTTACACATTGCCA 
USP1 downstream: AAAACTGCAGGCGGCCGCCCACCGCGGTGGGCTGGCTATGAAGAAATT 
30 USP2 downstream: CGCGGATCCGCTGGCTATGAAGAAATT 

USP3 downstream: TCCCCCGGGATCGATGCCGGCAGATCTGCTGGCTATGAAGAAATT 
OCS1 upstream: AAAACTGC AGTC T AGAAGGC C TCCTGCTT TAATGAGATAT 

35 

OCS2 upstream: 

CGCGGATC CGATATCGGGCCCGCTAGC GTTAACCCTGCTTTAATGAGATAT 
OCS3 upstream: TCC C C CGGGCC ATGGC CTGC TTTAATGAGATAT 

40 

0CS1 downstream: 

CCCAAGCTTGGCGCGCCGAGCTCGAATTCGTCGACGGACAATCAGTAAATTGA 

OCS2 downstream: 
4 5 CCCAAGCTTGGCGCGCCGAGCTCGAATTCGTCGACGGAC AATC AGTAAATTGA 

OCS3 downstream: CCCAAGCTTGGCGCGCCGAGCTCGTCGACGGACAATCAGTAAATTGA 



0093/00064 



CA 02485060 2004-10-28 



44 

4 

The methods are known to the specialist worker and are generally 
known from the literature. 

5 

In a first step, a promoter and a terminator are amplified via 
PCR. Then, the terminator is cloned into a recipient plasmid and, 
in a second step, the promoter is inserted upstream of the 
terminator. This gives an expression cassette on a plasmid 
10 vehicle. The plasmids pUTl, 2 and 3 are generated on the basis of 
the plasmid pUC19. 

The constructs are defined in accordance with the invention in 
SEQ ID NO: 33, 34 and 42. They comprise the USP promoter and the 

15 OCS terminator. Based on these plasmids, the construct pUTl2 is 
generated by cutting pUTl with Sail/Seal and cutting pUT2 with 
Xhol/Scal. The fragments in the expression cassettes are ligated 
and transformed into E . coli XLI blue MRF • After picking out 
ampicillin-resistant colonies, DNA is prepared, and those clones 

20 which comprise two expression cassettes are identified by 

restriction analysis. The Xhol/Sall ligation of compatible ends 
has eliminated the two cleavage sites Xhol and Sail between the 
expression cassettes. This gives rise to plasmid pUT12, which is 
defined in SEQ ID NO: 36. pUT12 is subsequently cut again with 

25 Sail/Seal and pUT3 with Xhol/Scal. The fragments comprising the 
expression cassettes are ligated and transformed into E. coli XLI 
blue MRP. After singling out ampicillin-resistant colonies, DNA 
is prepared, and those clones which comprise three expression 
cassettes are identified by restriction analysis. In this manner, 

30 a set of multiexpression cassettes is created which can be 

exploited for inserting the desired DNA and is described in Table 
1 and can additionally incorporate further expression cassettes. 

They comprise the following elements: 

35 



40 
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Table 1 





pUC19 
derivate 


Cleavage sites before 
the USP promoter 


Multiple 
cloning cleavage sites 


Cleavage sites behind the 
OCS terminator 


5 


pu 1 X 


EcoRI/AscI/ Sacl/Xhol 


BstXI/Notl/ Pstl/Xbal/StuI 


Sall/EcoRI/ Sacl/Ascl/ 
Hindlll 




pUT2 


EcoRI/AscI/ Sacl/Xhol 


BamHI/EcoRV/ Apal/Nhel/ Hpal 


Sall/EcoRI/ Sacl/Ascl/ 
Hindm 




pUT3 


EcoRI/AscI/ Sacl/Xhol 


Bglll/Nael/ Clal/Smal/Ncol 


Sall/SacI/ Ascl/Hindlll 


10 


pUT12 

Double 

expression 

cassette 


EcoRI/AscI/ Sacl/Xhol 


BstXI/Notl/ Pstl/Xbal/StuI 
and 

BamHI/EcoRV/ Apal/Nhel/ Hpal 


Sall/EcoRI/ Sacl/Ascl/- 
Kind in 








l.BstXI/NotI/ Pstl/Xbal/StuI 




15 


pUT123 

Triple 

expression 

cassette 


EcoRI/AscI/ Sacl/Xhol 


and 

2. BamHI/EcoRV/ Apal/Nhel/ 
Hpal 

and 

3. BglIl/NaeI/ Clal/Smal/Ncol 


Sall/Sacl/Ascl/Hindlll 

* 



20 

Furthermore, further multiexpression cassettes can be generated 
and employed for seed-specific gene expression , as described and 
as specified in greater detail in Table 2, with the aid of the 



25 i) USP promoter or with the aid of the 

ii) 700 base pair 3' fragment of the LeB4 promoter or with the 
aid of the 

iii) DC3 promoter. 

30 The DC 3 promoter is described in Thomas, Plant Cell 1996, 

263:359-368 and consists merely of the region -117 to +26, which 
is why it therefore constitutes one of the smallest known 
see d-specific promoters. The expression cassettes can comprise 
several copies of the same promoter or else be constructed via 

35 three different promoters. 

The vectors used for the transformation of plants and the 
sequences of the inserted genes /proteins can be found in sequence 
listing SEQ ID NO: 43 to 49. 

40 

Advantageously used polylinker or polylinker-terminator- 
polylinkers can be found in the sequences SEQ ID NO: 50 to 52. 



45 



0093/00064 CA 02485060 2004-10-28 



46 

Table 2: Multiple expression cassettes 



5 


Plasmid name of 
the pUCl9 
derivative 


Cleavage sites before 
the respective 
promoter 


Multiple 
cloning cleavage sites 


Cleavage sites behind 
the OCS terminator 




pirn 

(pUC19 with 
USP-OCS1) 


EcoRI/AscI/SacI/XhoI 


(1) BstXI/Notl/Psty Xbal/StuI 


Sall/EcoRI/SacI/Ascl/ 
Hindlll 


10 


pDCT 

(pUC19 with 
DC3-OCS) 


EcoRI/AscI/SacI/XhoI 


(2) RamHI/EcoRV/ Anal/Nhel/ 
Hpal 


Hindlll 




pUBT 

(pUC19-with 
LeB4(700)-OCS) 


EcoRI/AscI/SacI/XhoI 


(3) Bglll/Nael/ Clal/Smal/Ncol 


Sall/Sacl/Ascl/Hindlll 


15 


pUD12 

(pUC 19 with 
USP-OCS1 and 
with DC3-OCS) 


EcoRI/AscI/SacI/XhoI 


(1) BstXI/NotI/ Pstl/Xbal/StuI 
and 

(2) BamHI/EcoRV/ Apal/Nhel/ 
Hpal 


Sall/EcoRI/SacI/Ascl/ 
Hindlll 


20 


pUDL123 
Triple expression 

• 

cassette 

(pUCl9 with 
USP/DC3 and 
LeB4-700) 


EcoRI/AscI/Sacl/Xhol 


(1) BstXI/NotI/ Pstl/Xbal/Srul 
and - 

(2) BamHI/ (EcoRV>ApaI/ 
Nhel/Hpal and 

(3) Bglll/Nael/ ClaUSmal/NcoI 


Sall/SacI/AscI/HindHI 



25 

* EcoRV cleavage site in the 700 base-pair fragment of the LeB4 
promoter (LeB4-700 ) 



Further promoters for multi-gene constructs can be generated 
3Q analogously, in particular using the 

a) 2.7 kb fragment of the LeB4 promoter or with the aid of the 

b) phaseolin promoter or with the aid of the 

c) constitutive v-ATPase cl promoter. 

35 

It may be particularly desirable to use further especially 
suitable promoters for constructing seed-specific 
multi-expression cassettes such as, for example, the napin 
promoter or the arcelin-5 promoter. 

40 

II) Generation of expression constructs which comprise promoter, 
terminator and desired gene sequence for the expression of 
PUFA genes in plant expression cassettes. 

45 In pUT123, the A6-elongase Pp_PSEl is first inserted into the 
first cassette via BstXI and Xbal. Then, the moss A-6-desaturase 
(Pp_des6) is inserted into the second cassette via BamHI/Nael, 
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and, finally, the Phaeodactylum A5-desaturase (Pt_des5) is 
inserted into the third cassette via Bglll/Ncol. The triple 
construct is named pARAl. Talcing into consideration 
sequence-specific restriction cleavage sites, further expression 
5 cassettes are shown in Table 3, which are named pARA2, pARA3 and 
pARA4 # can be generated. 

Table 3 : Combinations of desaturases and elongases 



10 



15 



Gene plasmid 


A6-Desaturase 


A5-Desaturase 


A6-Elongase 


pARAl 


Pp_des6 


Pt_des5 


Pp_PSEl 


pARA2 


Pt_des6 


Pt_des5 


Pp_PSEl 


pARA3 


Pt_des6 


Ce_des5 


Pp_PSEl 


pABA4 


Ce_des6 


Ce_de s 5 


CeJPSEl 



Pp = Physcomitrella patens, Pt = Phaeodactylum tricornutum 
Pp_PSEl corresponds to the sequence of SEQ ID NO: 9. 
2Q PSE = PUFA- specific A6-elongase 

Ce_des5 = A5-desaturase from Caenorhabditis elegans (Genbank 
Acc. No- AF078796) 

Ce_des6 = A6-desaturase from Caenorhabditis elegans elegans 
(Genbank Acc. No. AF031477, bases 11-1342) 
25 Ce_PSEl = A6-elongase from Caenorhabditis elegans (Genbank 
Acc. No. AF244356, bases 1-867) 

Further desaturases or elongase sequences can also be inserted 
into the expression cassettes in the described manner, such as, 
3 q for example, Genbank Acc, Nr. AF231981, NM__013402, AF206662, 
AF268031, AF226273, AF110510 or AF11O509. 

iii) Transfer of expression cassettes into vectors for the 
transformation of Agrobacterium tumefaciens and for the 
35 transformation of plants 

The constructs generated thus are inserted into the binary vector 
pGPTV by means of Ascl. For this purpose, the multiple cloning 
sequence is extended by an Ascl cleavage site. For this purpose, 

40 the polylinker is synthesized de novo as two double-stranded 
oligonucleotides, thereby introducing an additional Ascl DNA 
sequence. The oligonucleotide is inserted into the vector pGPTV 
by means of EcoRI and Hindlll. The cloning techniques required 
are known to the skilled worker and can simply be found in the 

45 literature as described in Example 1. 
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Example 6: Studying the expression of a recombinant gene product 

in a transformed organism 

The activity of a recombinant gene product in the transformed 
5 host organism can be measured at the transcriptional and/or the 
translational level. 

A suitable method for determining the extent to which the gene is 
transcribed (which indicates the amount of RNA which is available 

10 for the translation of the gene product) is to carry out a 

Northern blot as detailed hereinbelow (as reference , see Ausubel 
et al. (1988) Current Protocols in Molecular Biology, Wiley: New 
York, or the abovementioned Examples Section), where a primer 
which is designed in such a way that it binds to the gene of 

15 interest is labeled with a detectable label (usually a 

radioactive label or a chemiluminescent label) so that r when the 
total RNA of a culture of the organism is extracted, separated on 
a gel, transferred onto a stable matrix and incubated with this 
probe, the binding and extent of the binding of the probe 

20 indicates the existence and also the amount of the mRNA for this 
gene. This information indicates the degree to which the 
transformed gene has been transcribed- Cellular total RNA can be 
prepared from cells, tissues or organs using a plurality of 
methods, all of which are known in the art, such as, for example, 

25 the method of Bormann, E.R., et al. (1992) Mol. Microbiol. 
6:317-326. 

Northern hybridization: 

30 To carry out the RNA hybridization, 20 jig of total RNA or 1 \ig of 
poly(A) + RNA were separated by gel electrophoresis in agarose gels 
with a strength of 1.25% using formaldehyde, as described in 
Amasino (1986, Anal. Biochem. 152, 304), capillary-blotted onto 
positively charged nylon membranes Hybond N+, Amersham, 

35 Braunschweig) using 10 x SSC, immobilized using UV-light and 

prehybridized for 3 hours at 68°C using hybridization buffer (10% 
dextran sulfate w/v, 1 M NaCl, 1% SDS, 100 mg herring sperm DNA) . 
The DNA probe was labeled with the Highprirae DNA labeling kit 
(Roche, Mannheim, Germany) during the prehybridization step, 

40 using alpha- 32 P-dCTP (Amersham, Braunschweig, Germany). After the 
labeled DNA probe had been added, the hybridization was carried 
out overnight at 68°C in the same buffer. The wash steps were 
carried out twice for 15 minutes using 2 x SSC and twice for 
30 minutes using 1 x SSC, 1% SDS, at 68°C. The sealed filters were 

45 exposed at -70°C for a period of 1 to 14 days. 
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Standard techniques, such as a Western blot, can be employed for 
studying the presence or the relative amount of protein 
translated by this mRNA (see, for example, Ausubel et al. (1988) 
Current Protocols in Molecular Biology, Wiley: New York). In this 
5 method, the cellular total proteins are extracted, separated by 
gel electrophoresis, transferred to a matrix such as 
nitrocellulose and incubated with a probe such as an antibody 
which binds specifically to the desired protein. This probe is 
usually provided with a chemiluminescent or colorimetric label 
10 which can be detected readily. The presence and the amount of the 
label observed indicates the presence and the amount of the 
desired mutated protein present in the cell. 

Example 7: Analysis of the effect of the recombinant proteins on 
15 the production of the desired product 

The effect of the genetic modification in plants , • fungi, algae, 
ciliates or on the production of a desired compound (such as a 
fatty acid) can be determined by -growing the modified 

20 microorganisms or the modified plant under suitable conditions 
(such as those described above) and analyzing the medium and/or 
the cellular components for the increased production of the 
desired product (i.e. of lipids or a fatty acid). These 
analytical techniques are known to the skilled worker and 

25 comprise spectroscopy, thin-layer chromatography, various tyes of 
staining methods, enzymatic and microbiological methods, and 
analytical chromatography such as high-performance liquid 
chromatography (see, for example, Ullman, Encyclopedia of 
Industrial Chemistry, vol. A2, pp. 89-90 and pp. 443-613, VCH: 

30 Weinheim (1985); Fallon, A. , et al., (1987) "Applications of HPLC 
in Biochemistry" in; Laboratory Techniques in Biochemistry and 
Molecular Biology, vol- 17; Rehm et al. (1993) Biotechnology, 
vol. 3, chapter III: "Product recovery and purification", pp. 
469-714, VCH: Weinheim; Belter, P. A., et al. (1988) 

35 Bioseparations : downstream processing for Biotechnology, John 
Wiley and Sons? Kennedy, J.F. and Cabral, J. M.S. (1992) Recovery 
processes for biological Materials, John Wiley and Sons; 
Shaeiwitz, J. A., and Henry, J.D. (1988) Biochemical Separations, 
in: Ullmann's Encyclopedia of Industrial Chemistry, vol. B3; 

40 chapter 11, pp. 1-27, VCH: Weinheim; and Dechow, F.J. (1989) 
Separation and purification techniques in biotechnology, Noyes 
Publications) - 

In addition to the abovementioned methods, plant lipids are 
45 extracted from plant material as described by Cahoon et al. 

(1999) Proc. Natl. Acad. Sci. USA 96 (22 ): 12935-12940, and Browse 
et al. (1986) Analytic Biochemistry 152:141-14 5. The qualitative 



0093/00064 



CA 02485060 2004-10-26 



50 

and quantitative lipid and fatty acid analysis is described by 
Christie, William w. , Advances in Lipid Methodology, 
Ayr/Scotland: Oily Press (Oily Press Lipid Library; 2); Christie, 
William W., Gas Chromatography and Lipids. A Practical Guide 
5 - Ayr, Scotland: Oily Press, 1989, Repr. 1992, IX, 307 S. (Oily 
Press Lipid Library; 1); "Progress in Lipid Research, Oxford: 
Pergamon Press, 1 (1952) - 16 (1977) u.d.T.: Progress in the 
Chemistry of Fats and Other Lipids C0DEN. 

10 To determine the overall efficiency with which the compound is 
produced, it is also possible, in addition to measuring the 
fermentation end product r to analyze other components of the 
metabolic pathways which are used for producing the desired 
compounds, such as intermediates and secondary products. The 

15 analytical methods comprise measurements of the nutrient 
quantities in the medium (for example sugars, hydrocarbons, 
nitrogen sources, phosphate and other ions), measurements of the 
biomass composition and the growth, analysis of the production of 
usual metabolites via biosynthetic pathways, and measurements of 

20 gases which are generated during the fermentation process. 

Standard methods for these measurements are described in Applied 
Microbial Physiology; A Practical Approach, P.M. Rhodes and 
P.F. Stanbury, ed., I EL Press, pp. 103-129; 131-163 and 165-192 
(ISBN: 0199635773) and references cited therein. 

25 

One example is the analysis of fatty acids (abbreviations: FAMEs, 
fatty acid methyl esters; GC-MS, gas liquid chromatography /mass 
spectrometry; TAG, triacylglycerol; TLC, thin-layer 
chromatography) . 

30 

Unequivocal proof for the presence of fatty acid products can be 
obtained by the analysis of recombinant organisms following 
standard analytical procedures: GC, GC-MS or TLC as variously 
described by Christie and references therein (1997, in: Advances 
35 on Lipid Methodology, Fourth ed.: Christie, Oily Press, Dundee, 
119-169; 1998, gas-chromatography/mass-spectrometry methods, 
Lipids 33:343-353). 

Material to be analyzed can be disintegrated via sonif ication, 
40 glass milling, liquid nitrogen and grinding or via other 

applicable methods. The material has to be centrifuged after 
disintegration. The sediment is resuspended in Aqua dest, heated 
for 10 min at 100°C, cooled on ice and centrifuged again, followed 
by extraction in 0.5 M sulfuric acid in methanol containing 2% 
45 dimethoxypropane for 1 h at 90°C, leading to hydrolyzed oil and 
liquid compounds, resulting in transmethylated lipids. These 
fatty acid methyl esters are extracted in petrolether and finally 
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subjected to GC analysis using a capillary column (Chrompack, 
WCOT Fused Silica, CP-Wax-52 CB, 25 ^m, 0.32 mm) at a temperature 
gradient beteen 170°C and 240°C for 20 min and 5 min at 240°C. The 
identity of resulting fatty acid methylesters has to be defined 
5 by the use of standards available from commercial sources (i.e. 
Sigma) . 

In the case of fatty acids where standards are not available 
molecule identity has to be shown via derivati2ation and 
10 subsequent GC MS analysis. Par example the localization of triple 
bond fatty acids has to be shown via GC-MS after derivatization 
via 4 , 4-dimethoxyoxazoline derivatives (Christie, 1998 , see 
above ) . 

15 Expression constructs in heterologous microbial systems 
Strains, Growth Conditions and Plasmids 

Escherichia coli strain XLl Blue MRF ' kan (Stratagene) was used 
20 for sub-cloning the new elongase pPDesaturasel from 

Physcomitrella patens. For functional expression of this gene we 
used the Saccharomyces cerevisiae strain INVSc 1 (Invitrogen 
Co.). E. coli was grown in Luria-Bertini broth (LB, Duchefa, 
Haarlem, The Netherlands) at 37°C. When necessary, ampicillin 
25 (100 mg/liter) was added and 1.5% (w/v) agar was included for 
solid LB media. S. cerevisiae was grown at 30°C either in 
YPG-medium or in complete minimal dropout uracil medium (CMdum; 
see in: Ausubel, F.M., Brent, R. , Kingston, R.E., Moore, D.D., 
Seidman, J.G., Smith, J. A., Struhl, K • , Albright, L.B., 
30 Coen, D.M . , and Varki, A. (1995) Current Protocols in Molecular 
Biology, John Wiley & Sons, New York) containing either 2% (w/v) 
raffinose or glucose. For solid media 2% (w/v) Bacto™ agar 
(Difco) was included. Plasmids used for cloning and expression 
were pUClB (Pharmacia) and pYES2 (Invitrogen Co.). 

35 

Example 8: Cloning and expression of PUFA-specif ic desaturases 

and elongases 

For expression in plants, cDNA clones from SEQ ID NO: 1, 3, 5, 7, 
40 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 were modified in 
such a way that only the coding region was amplified by means of 
polymerase chain reaction using two oligonucleotides. Care was 
taken that a consensus sequence before the start codon was 
maintained for efficient translation. To this end, either the 
45 base sequence ATA or AAA was chosen and introduced into the 

sequence before the ATG (Kozak, M . (1986) Point mutations define 
a sequence flanking the AUG initiator codon that modulates 
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translation by eukaryotic ribosomes, Cell 44, 283-292) • In 
addition, a restriction cleavage site was introduced before this 
consensus triplet , which restriction cleavage site must be 
compatible with the cleavage site of the target vector into which 
5 the fragment is to be cloned and with the aid of which the 

expression of genes in microorganisms or plants is to take place. 

The PCR reaction was performed with plasmid DNA as template in a 
Thermocycler (Biometra) using the Pfu-DNA (Stratagene) polymerase 

10 and the following temperature programme: 3 minutes at 96°C, 

followed by 30 cycles with 30 seconds at 96°C, 30 seconds at 55°C 
and 2 minutes at 72°C, 1 cycle with 10 minutes at 72°C and stop at 
4°C. The annealing temperature was varied, depending on the 
oligonucleotides chosen. A synthesis time of approximately one 

15 minute can be assumed per kilobase pairs DNA. Further parameters 
which have an effect on the PCR such as, for example, Mg ions, 
salt, DNA polymerase and the like are known to the specialist 
worker and can be varied as required. 

20 The correct size of the amplified ~DNA fragment was verified by 
agarose-TBE gel electrophoresis. The amplified DNA was extracted 
from the gel using the QIAquick Gel Extraction Kit (QIAGEN) and 
ligated into the Smal restriction .site of the dephosphorylated 
vector pUC18 using the Sure Clone Ligation Kit (Pharmacia), 

25 giving rise to the pUC derivatives. After the transformation of 
E. coli XL1 Blue MRF 9 kan, a DNA miniprep (Riggs, M.G., & 
McLachlan, A. (1986) A simplified screening procedure for large 
numbers of plasmid mini-preparation. BioTechniques 4, 310-313) 
was carried out on ampicillin-resistant transf ormants, and 

30 positive clones were identified by means of BamHI restriction 
analysis. The sequence of the cloned PCR product was verified by 
resequencing using the ABI PRISM Big Dye Terminator Cycle 
Sequencing Ready Reaction Kit (Per kin-Elmer, Weiterstadt) . 

35 Fatty acid analysis 

The total fatty acids were extracted from plant seeds and 
analyzed by gas chromatography. 

The seeds were taken up in 1% sodium methoxide in methanol and 
40 incubated for 20 minutes at RT. Thereafter, the mixture is washed 
with NaCl solution , and the FAMEs are taken up in 0.3 ml heptane. 
The samples were separated on a ZEBRON-ZB-Wax capillary column 
(30 m, 0.32 mm, 0.25 \m; Phenomenex) in a Hewlett Packard-6850 gas 
chromatograph with flame ionization detector. The oven 
45 temperature was programmed from 70°C (1 minute hold) to 200°C at a 
rate of 20°C/minute, then to 250°C (5 min hold) at a rate of 
5°C/min and finally to 260°C at a rate of 5°C/min. Nitrogen was 
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used as carrier gas (4,5 ml/min at 70°C). The fatty acids were 
identified by comparing the retention times with those of FAME 
standards ( SIGMA ) . 

5 Expression analysis 

Result of the expression of a Phaeodactyluiti tricornutum 
A6-acyl-lipid desaturase, a Phaeodactylum tricornutum 
A5-acyl-lipid desaturase and the delta-6-specif ic elongase in 
10 tobacco seeds: 

Figure 2: Fatty acid profile of transgenic tobacco seeds. The 
plants were transformed with a triple expression cassette which 
expresses, under the control of the USP promoter , the delta-6-, 
15 the delta-5- and the Physcomitrella patens PpPSEl (pARA2). 100 
transgenic tobacco and linseed plants are generated, of which 
approximately 20% synthesize arachidonic acid in the seed. 

Figure 3: Tobacco wild-type control. 

20 

Example 9: Purification of the desired product from transformed 

organisms 

The desired product can be obtained from plant material or fungi, 
25 algae, ciliates, animal cells or from the supernatant of the 

above-described cultures by various methods known in the art. If 
the desired product is not excreted from the cells, the cells can 
be harvested from the culture by slow centrif ugation, and the 
cells can be lyzed by standard techniques such as mechanical 
30 force or sonication. Plant organs can be separated mechanically 
from other tissue or other organs. After homogenization, the cell 
debris is removed by centrif ugation, and the supernatant 
fraction, which comprises the soluble proteins, is stored for the 
further purification of the desired compound. If the product is 
35 excreted from desired cells, the cells are removed from the 

culture by slow centrif ugation, and the supernatant fraction is 
stored for further purification. 

The supernatant fraction of each purification method is subjected 
40 to chromatography with a suitable resin, the desired molecule 
either being retained on the chromatography resin, while many 
contaminations in the sample are not, or else the contaminations 
are retained on the resin, while the sample is not. If necessary, 
these chromatography steps can be repeated, using identical or 
45 different chromatography resins. The skilled worker is familiar 
with the selection of suitable chromatography resins and their 
most effective application for a particular molecule to be 
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purified. The purified product can be concentrated by filtration 
or ultrafiltration and stored at a temperature which provides 
maximum stability of the product. 

5 A broad spectrum of purification methods is known in the art, and 
the above purification method is not intended to be limiting. 
These purification methods are described, for example/ in 
Bailey, J.E., & Ollis, D.F. , Biochemical Engineering 
Fundamentals, McGraw-Hill: New York (1986). 

10 

The identity and purity of the compounds which have been isolated 
can be determined by standard techniques of the art* These 
include high-performance liquid chromatography (HPLC), 
spectroscopic methods, staining methods, thin-layer 

15 chromatography, in particular thin-layer chromatography and flame 
ionization detection (IATROSCAN, latron, Tokio, Japan), NIRS, 
enzyme assay or microbiological methods. For an overview of these 
analytical methods, see: Patek et al. (1994) Appl. Environ • 
Microbiol. 60:133-140; Malakhova et al. (1996) Biotekhnologiya 

20 11:27-32; and Schmidt et al. (1998) Bioprocess Engineer. 

19:67-70. Ulmann's Encyclopedia of Industrial Chemistry (1996) 
vol. A27, VCH: Weinheim, pp. 89-90, pp. 521-540, pp. 540-547, 
pp. 559-566, 575-581 and pp. 581-587; Michal, G (1999) 
Biochemical Pathways: An Atlas of Biochemistry and Molecular 

25 Biology, John Wiley and Sons; Fallon, A., et al. (1987) 

Applications of HPLC in Biochemistry in: Laboratory Techniques in 
Biochemistry and Molecular Biology, vol. 17. 

Equivalents 

30 

The skilled worker will or can recognize many equivalents of the 
specific embodiments according to the invention described herein 
by simply using routine experiments. These equivalents are 
intended to fall within the patent claims . 

35 



40 



45 
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SEQUENCE LISTING 

<110> BASF Plant Science GmbH 

<120> Process for the production of polyunsaturated 
fatty acids in plants 

<130> 2.002/271 - - . - . . . - - - 

<140> 2002_271 
<141> 2002-04-26 

<160> 64 

<170> Patentln Vers. 2.0 

<210> 1 
<211> 1687 
<212> DNA 

<213> Borago officinalis 

■ 

<220> 

<221> CDS 

<222> (42). .(1388) 

<223> A6-desaturase 

<400> 1 

tatctgccta ccctcccaaa gagagtagtc atttttcatc a atg get get caa ate 5 6 

Met Ala Ala Gin He 
1 5 

aag aaa tac att acc tea gat gaa etc aag aac cac gat aaa ccc gga 104 
Lys Lys Tyr lie Thr Ser Asp Glu Leu Lys Asn His Asp Lys Pro Gly 

10 15 20 

gat eta tgg ate teg att caa ggg aaa gee tat gat gtt teg gat tgg 15 2 
Asp Leu Trp He Ser He Gin Gly Lys Ala Tyr Asp Val Ser Asp Trp 

25 30 35 
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gtg aaa gac cat cca ggt ggc age ttt ccc ttg aag agt ctt get ggt 200 

Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu Lys Ser Leu Ala Gly 

40 45 50 

caa gag gta act gat gca ttt gtt gca ttc cat cct gec tct aca tgg 248 

Gin Glu Val Thr Asp Ala Phe Val Ala Phe His Pro Ala Ser Thr Trp 

55 60 65 

aag aat ctt gat aag ttt ttc act ggg tat tat ctt aaa gat tac tct 2 96 

Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr Leu Lys Asp Tyr Ser 

70 75 80 85 

gtt tct gag gtt tct aaa gat tat agg aag ctt gtg ttt gag ttt tct 344 

Val Ser Glu Val Ser Lys Asp Tyr Arg Lys Leu Val Phe Glu Phe Ser 

90 95 100 

aaa atg ggt ttg tat gac aaa aaa ggt cat att atg ttt gca act ttg 392 

Lys Met Gly Leu Tyr Asp Lys Lys Gly His lie Met Phe Ala Thr Leu 

105 110 115 

tgc ttt ata gca atg ctg ttt get atg agt gtt tat ggg gtt ttg ttt 440 

Cys Phe lie Ala Met Leu Phe Ala Met Ser Val Tyr Gly Val Leu Phe 

120 125 130 

tgt gag ggt gtt ttg gta cat ttg ttt tct ggg tgt ttg atg ggg ttt 488 

Cys Glu Gly Val Leu Val His Leu Phe Ser Gly Cys Leu Met Gly Phe 

135 140 145 

ctt tgg att cag agt ggt tgg att gga cat gat get ggg cat tat atg 536 

Leu Trp lie Gin Ser Gly Trp lie Gly His Asp Ala Gly His Tyr Met 

150 155 160 165 

gta gtg tct gat tea agg ctt aat aag ttt atg ggt att ttt get gca 584 

Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met Gly He Phe Ala Ala 

170 175 180 

aat tgt ctt tea gga ata agt att ggt tgg tgg aaa tgg aac cat aat 632 

Asn Cys Leu Ser Gly Tie Ser He Gly Trp Trp Lys Trp Asn His Asn 

185 190 195 
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gca cat cac att gcc tgt aat age ctt 

Ala His His lie Ala Cys Asn Ser Leu 

200 205 

tat ata cca ttc ctt gtt gtg tct tec 

Tyr lie Pro Phe Leu Val Val Ser Ser 

215 220 

tct cat ttc tat gag aaa agg ttg act 

Ser His Phe Tyr Glu Lys Arg Leu Thr 

230 235 

ttt gta agt tat caa cat tgg aca ttt 

Phe Val Ser Tyr Gin His Trp Thr Phe 

250 

agg etc aat atg tat gta caa tct etc 

Arg Leu Asn Met Tyr Val Gin Ser Leu 

265 270 



3 

gaa tat gac cct gat tta caa 680 
Glu Tyr Asp Pro Asp Leu Gin 

210 

aag ttt ttt ggt tea etc ace 7 28 
Lys Phe Phe Gly Ser Leu Thr 
225 

ttt gac tct tta tea aga ttc 7 76 
Phe Asp Ser Leu Ser Arg Phe 
240 245 

tac cct att atg tgt get get 824 
Tyr Pro He Met Cys Ala Ala 
255 260 

ata atg ttg ttg acc aag aga 872 
He Met Leu Leu Thr Lys Arg 

275 



aat gtg tec tat cga get cat gaa etc ttg gga tgc eta gtg ttc teg 920 

Asn Val Ser Tyr Arg Ala His Glu Leu Leu Gly Cys Leu Val Phe Ser 
280 285 290 

att tgg tac ccg ttg ctt gtt tct tgt ttg cct aat tgg ggt gaa aga 968 

He Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro Asn Trp Gly Glu Arg 
295 300 305 

att atg ttt gtt att gca agt ttg tea gtg act gga atg caa caa gtt 1016 

He Met Phe Val lie Ala Ser Leu Ser Val Thr Gly Met Gin Gin Val 

310 315 320 325 



cag ttc tec ttg aac cac ttc tct tea agt gtt tat gtt gga aag cct 1064 
Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val Tyr Val Gly Lys Pro 

330 335 340 



aaa ggg aat aat tgg ttt gag aaa caa acg gat ggg aca ctt gac att 1112 
Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp Gly Thr Leu Asp He 

345 350 355 
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tct tgt cct cct tgg atg gat tgg ttt cat ggt gga ttg caa ttc caa 1160 
Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly Gly Leu Gin Phe Gin 
360 365 370 

att gag cat cat ttg ttt ccc aag atg cct aga tgc aac ctt agg aaa 1208 
lie Glu His His Leu Phe Pro Lys Met Pro Arg Cys Asn Leu Arg Lys 
375 380 385 

ate teg ccc tac gtg ate gag tta tgc aag aaa cat aat ttg cct tac 1256 
lie Ser Pro Tyr Val He Glu Leu Cys Lys Lys His Asn Leu Pro Tyr 
390 395 400 405 

aat tat gca tct ttc tec aag gec aat gaa atg aca etc aga aca ttg 1304 
Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met Thr Leu Arg Thr Leu 

410 415 420 

agg aac aca gca ttg cag get agg gat ata acc aag ccg etc ccg aag 1352 
Arg Asn Thr Ala Leu Gin Ala Arg Asp He Thr Lys Pro Leu Pro Lys 

425 430 435 

aat ttg gta tgg gaa get ctt cac act cat ggt taa aattaccctt 1398 
Asn Leu Val Trp Glu Ala Leu His Thr His Gly 
440 445 

agttcatgta ataatttgag attatgtatc tcctatgttt gtgtcttgtc ttggttctac 1458 

ttgttggagt cattgeaact tgtcttttat ggtttattag atgtttttta atatatttta 1518 

gaggttttgc tttcatctcc attattgatg aataaggagt tgcatattgt caattgttgt 1578 

gctcaatatc tgatattttg gaatgtactt tgtaccactg tgttttcagt tgaagctcat 1638 

gtgtacttct atagactttg tttaaatggt tatgaaaaaa aaaaaaaaa 1687 

<210> 2 
<211> 448 
<212> PRT 

<213> Borago officinalis 
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<400> 2 

Met Ala Ala Gin 
1 

His Asp Lys Pro 

20 

Asp Val Ser Asp 

35 

Lys Ser Leu Ala 
50 

Pro Ala Ser Thr 
65 

Leu Lys Asp Tyr 



Val Phe Glu Phe 

100 

Met Phe Ala Thr 
115 

Tyr Gly Val Leu 
130 

Cys Leu Met Gly 
145 

Ala Gly His Tyr 



Gly lie Phe Ala 

180 

Lys Trp Asn His 
195 



lie Lys Lys Tyr 
5 

Gly Asp Leu Trp 



Trp Val Lys Asp 

40 

Gly Gin Glu Val 

55 

Trp Lys Asn Leu 
70 

Ser Val Ser Glu 
85 

Ser Lys Met Gly 



Leu Cys Phe lie 

120 

Phe Cys Glu Gly 
135 

Phe Leu Trp lie 
150 

Met Val Val Ser 
165 

Ala Asn Cys Leu 



Asn Ala His His 

200 



5 

lie Thr Ser Asp 
10 

He Ser He Gin 
25 

His Pro Gly Gly 



Thr Asp Ala Phe 

60 

Asp Lys Phe Phe 

75 

Val Ser Lys Asp 
90 

Leu Tyr Asp Lys 
105 

Ala Met Leu Phe 



Val Leu Val His 

140 

Gin Ser Gly Trp 
155 

Asp Ser Arg Leu 
170 

Ser Gly He Ser 
185 

He Ala Cys Asn 



Glu Leu Lys Asn 

15 

Gly Lys Ala Tyr 
30 

Ser Phe Pro Leu 
45 

Val Ala Phe His 



Thr Gly Tyr Tyr 

80 

Tyr Arg Lys Leu 

95 

Lys Gly His He 
110 

Ala Met Ser Val 
125 

Leu Phe Ser Gly 

He Gly His Asp 

160 

Asn Lys Phe Met 
175 

He Gly Trp Trp 

190 

Ser Leu Glu Tyr 
205 
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Asp Pro Asp Leu 
210 

Phe Gly Ser Leu 
225 

Ser Leu Ser Arg 



He Met Cys Ala 

260 

Leu Leu Thr Lys 
275 

Cys Leu Val Phe 
290 

Asn Trp Gly Glu 
305 

Gly Met Gin Gin 

Tyr Val Gly Lys 

340 

Gly Thr Leu Asp 
355 

Gly Leu Gin Phe 
370 

Cys Asn Leu Arg 
385 

His Asn Leu Pro 



Gin Tyr He Pro 
215 

Thr Ser His Phe 
230 

Phe Phe Val Ser 
245 

Ala Arg Leu Asn 



Arg Asn Val Ser 

280 

Ser He Trp Tyr 
295 

Arg He Met Phe 
310 

Val Gin Phe Ser 
325 

Pro Lys Gly Asn 



He Ser Cys Pro 

360 

Gin He Glu His 
375 

Lys He Ser Pro 
390 

Tyr Asn Tyr Ala 
405 



6 

Phe Leu Val Val 

220 

Tyr Glu Lys Arg 
235 

Tyr Gin His Trp 
250 

Met Tyr Val Gin 
265 

Tyr Arg Ala His 



Pro Leu Leu Val 

300 

Val He Ala Ser 
315 

Leu Asn His Phe 

330 

Asn Trp Phe Glu 
345 

Pro Trp Met Asp 

His Leu Phe Pro 

380 

Tyr Val He Glu 
395 

Ser Phe Ser Lys 
410 



Ser Ser Lys Phe 

Leu Thr Phe Asp 

240 

Thr Phe Tyr Pro 
255 

Ser Leu He Met 
270 

Glu Leu Leu Gly 
285 

Ser Cys Leu Pro 

Leu Ser Val Thr 

320 

Ser Ser Ser Val 
335 

Lys Gin Thr Asp 
350 

Trp Phe His Gly 
365 

Lys Met Pro Arg 



Leu Cys Lys Lys 

400 

Ala Asn Glu Met 
415 



Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp He Thr 
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420 



425 



430 



Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr His Gly 
435 440 445 



<210> 3 
<211> 1192 
<212> DNA 

<213> Physcomitrella patens 

<220> 

<221> CDS 

<222> (58). -(930) 

<223> A6-elongase 

<400> 3 

ctgcttcgtc tcatcttggg ggtgtgattc gggagtgggt tgagttggtg gagcgca 57 

atg gag gtc gtg gag aga ttc tac ggt gag ttg gat ggg aag gtc teg 105 
Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
15 10 15 

cag ggc gtg aat gca ttg ctg ggt agt ttt ggg gtg gag ttg acg gat 153 
Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 

20 25 30 

acg ccc act acc aaa ggc ttg ccc etc gtt gac agt ccc aca ccc ate 201 
Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro lie 
35 40 45 

gtc etc ggt gtt tct gta tac ttg act att gtc att gga ggg ctt ttg 249 
Val Leu Gly Val Ser Val Tyr Leu Thr He Val He Gly Gly Leu Leu 
50 55 60 

tgg ata aag gec agg gat ctg aaa ccg cgc gec teg gag cca ttt ttg 297 
Trp He Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80 

etc caa get ttg gtg ctt gtg cac aac ctg ttc tgt ttt gcg etc agt 345 
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Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 

85 90 95 

ctg tat atg tgc gtg ggc ate get tat cag get att acc tgg egg tac 393 
Leu Tyr Met Cys Val Gly lie Ala Tyr Gin Ala lie Thr Trp Arg Tyr 

100 105 110 

tct etc tgg ggc aat gca tac aat cct aaa cat aaa gag atg gcg att 441 
Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala He 
115 120 125 

ctg gta tac ttg ttc tac atg tct aag tac gtg gaa ttc atg gat acc 489 
Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 

gtt ate atg ata ctg aag cgc age acc agg caa ata age ttc etc cac 537 
Val lie Met lie Leu Lys Arg Ser Thr Arg Gin lie Ser Phe Leu His 
145 150 155 160 

gtt tat cat cat tct tea att tec etc att tgg tgg get att get cat 585 
Val Tyr His His Ser Ser He Ser Leu He Trp Trp Ala lie Ala His 

165 170 175 

cac get cct ggc ggt gaa gca tat tgg tct gcg get ctg aac tea gga 633 
His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 

180 185 190 

gtg cat gtt etc atg tat gcg tat tac ttc ttg get gee tgc ctt cga 681 
Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 

agt age cca aag tta aaa aat aag tac ctt ttt tgg ggc agg tac ttg 729 
Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 

aca caa ttc caa atg ttc cag ttt atg ctg aac tta gtg cag get tac 777 
Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
225 230 235 240 



tac gac atg aaa acg aat gcg cca tat cca caa tgg ctg ate aag att 825 
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Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys lie 

245 250 255 

ttg ttc tac tac atg ate teg ttg ctg ttt ctt ttc ggc aat ttt tac 873 
Leu Phe Tyr Tyr Met He Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 

260 265 270 

gta caa aaa tac ate aaa ccc tct gac gga aag caa aag gga get aaa 921 
Val Gin Lys Tyr He Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 

act gag tga gctgtatcaa gecatagaaa ctctattatg ttagaacctg 970 
Thr Glu 
290 

aagttggtgc tttcttatct ccacttatct tttaagcagc atcagttttg aaatgatgtg . 1030 

tgggcgtggt ctgeaagtag tcatcaatat aatcggcctg agcacttcag atggattgtt 1090 

agaacatgag taaaagcggt tattaeggtg tttattttgt accaaatcac cgcacgggtg 1150 
aattgaaata tttcagattt gatcaatttc atctgaaaaa aa 119 2 



<210> 4 
<211> 290 
<212> PRT 

<2 13> Physcomitrella patens 
<400> 4 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
15 10 15 

Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 

20 25 30 

Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro He 
35 40 45 



Val Leu Gly Val Ser Val Tyr Leu Thr He Val lie Gly Gly Leu Leu 
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Trp lie Lys Ala 
65 

Leu Gin Ala Leu 



Leu Tyr Met Cys 

100 

Ser Leu Trp Gly 
115 

Leu Val Tyr Leu 
130 

Val lie Met He 
145 

Val Tyr His His 



His Ala Pro Gly 

180 

Val His Val Leu 
195 

Ser Ser Pro Lys 
210 

Thr Gin Phe Gin 
225 

Tyr Asp Met Lys 



Leu Phe Tyr Tyr 

260 



55 

Arg Asp Leu Lys 
70 

Val Leu Val His 
85 

Val Gly He Ala 



Asn Ala Tyr Asn 

120 

Phe Tyr Met Ser 

135 

Leu Lys Arg Ser 
150 

Ser Ser He Ser 
165 

Gly Glu Ala Tyr 



Met Tyr Ala Tyr 

200 

Leu Lys Asn Lys 
215 

Met Phe Gin Phe 
230 

Thr Asn Ala Pro 
245 

Met He Ser Leu 



10 

60 

Pro Arg Ala Ser 

75 

Asn Leu Phe Cys 
90 

Tyr Gin Ala lie 
105 

Pro Lys His Lys 



Lys Tyr Val Glu 

140 

Thr Arg Gin He 
155 

Leu He Trp Trp 
170 

Trp Ser Ala Ala 
185 

Tyr Phe Leu Ala 



Tyr Leu Phe Trp 

220 

Met Leu Asn Leu 
235 

Tyr Pro Gin Trp 
250 

Leu Phe Leu Phe 
265 



Glu Pro Phe Leu 

80 

Phe Ala Leu Ser 

95 

Thr Trp Arg Tyr 
110 

Glu Met Ala He 
125 

Phe Met Asp Thr 



Ser Phe Leu His 

160 

Ala He Ala His 
175 

Leu Asn Ser Gly 
190 

Ala Cys Leu Arg 
205 

Gly Arg Tyr Leu 



Val Gin Ala Tyr 

240 

Leu He Lys He 
255 

Gly Asn Phe Tyr 
270 
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Val Gin Lys Tyr lie Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 

Thr Glu 
290 



<210> 5 
<211> 1054 
<212> DNA 

<213> Thraustochytrium 

<220> 

<221> CDS 

<222> (43). -(858) 

<223> A6-elangase 

■ 

<400> 5 

gaattcggca cgagagcgcg cggagcggag acctcggccg eg atg atg gag ccg 54 

Met Met Glu Pro 
1 

etc gac agg tac agg gcg ctg gcg gag etc gec gcg agg tac gee age 102 
Leu Asp Arg Tyr Arg Ala Leu Ala Glu Leu Ala Ala Arg Tyr Ala Ser 
5 10 15 20 

teg gcg gee ttc aag tgg caa gtc acg tac gac gee aag gac age ttc 150 
Ser Ala Ala Phe Lys Trp Gin Val Thr Tyr Asp Ala Lys Asp Ser Phe 

25 30 35 

gtc ggg ccc ctg gga ate egg gag ccg etc ggg etc ctg gtg ggc tec 198 
Val Gly Pro Leu Gly lie Arg Glu Pro Leu Gly Leu Leu Val Gly Ser 

40 45 50 

gtg gtc etc tac ctg age ctg ctg gee gtg gtc tac gcg ctg egg aac 2 46 
Val Val Leu Tyr Leu Ser Leu Leu Ala Val Val Tyr Ala Leu Arg Asn 
55 60 65 

tac ctt ggc ggc etc atg gcg etc cgc age gtg cat aac etc ggg etc 294 
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Tyr Leu Gly Gly Leu Met Ala Leu Arg Ser Val His Asn Leu Gly Leu 
70 75 80 

tgc etc ttc teg ggc gec gtg tgg ate tac acg age tac etc atg ate 342 
Cys Leu Phe Ser Gly Ala Val Trp lie Tyr Thr Ser Tyr Leu Met He 
85 90 95 100 



cag gat ggg cac ttt cgc age etc gag gcg gca acg tgc gag ccg etc 390 
Gin Asp Gly His Phe Arg Ser Leu Glu Ala Ala Thr Cys Glu Pro Leu 

105 110 115 

aag cat ccg cac ttc cag etc ate age ttg etc ttt gcg ctg tec aag 438 
Lys His Pro His Phe Gin Leu lie Ser Leu Leu Phe Ala Leu Ser Lys 

120 125 130 

ate tgg gag tgg ttc gac acg gtg etc etc ate gtc aag ggc aac aag 486 
lie Trp Glu Trp Phe Asp Thr Val Leu Leu He Val Lys Gly Asn Lys 
135 140 145 



etc cgc ttc ctg cac gtc ttg cac cac 
Leu Arg Phe Leu His Val Leu His His 
150 155 

gee ate gac cac ate ttt etc teg tec 
Ala He Asp His lie Phe Leu Ser Ser 
165 170 

aat get ttc ate cac acc gtc atg tac 
Asn Ala Phe He His Thr Val Met Tyr 

185 

ccg aag ggc ttg cgc ccg ctt att acg 
Pro Lys Gly Leu Arg Pro Leu He Thr 

200 205 



gec acg acc ttt tgg etc tac 534 
Ala Thr Thr Phe Trp Leu Tyr 
160 

ate aag tac ggc gtc gcg gtc 582 
He Lys Tyr Gly Val Ala Val 
175 180 

gcg cac tac ttc cgc cca ttc 6 30 
Ala His Tyr Phe Arg Pro Phe 
190 195 

cag ttg cag ate gtc cag ttc 6 78 
Gin Leu Gin lie Val Gin Phe 

210 



att ttc age ate ggc ate cat acc gee att tac tgg cac tac gac tgc 7 26 
He Phe Ser He Gly He His Thr Ala He Tyr Trp His Tyr Asp Cys 
215 220 225 



gag ccg etc gtg cat acc cac ttt tgg gaa tac gtc acg ccc tac ctt 774 
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Glu Pro Leu Val His Thr His Phe Trp Glu Tyr Val Thr Pro Tyr Leu 
230 235 240 

ttc gtc gtg ccc ttc etc ate etc ttt ttc aat ttt tac ctg cag cag B22 
Phe Val Val Pro Phe Leu lie Leu Phe Phe Asn Phe Tyr Leu Gin Gin 
245 250 255 260 

tac gtc etc gcg ccc gca aaa acc aag aag gca tag ccacgtaaca 868 
Tyr Val Leu Ala Pro Ala Lys Thr Lys Lys Ala 

265 270 

gtagaccagc agegecgagg acgcgtgccg cgttatcgcg aagcacgaaa taaagaagat. 928 



catttgattc aacgaggcta cttgcggcca cgagaaaaaa aaaaaaaaaa aaaaaaaaaa 



988 



aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1048 



ctcgag 



1054 



<210> 6 
<211> 271 
<212> PRT 

<213> Thraustochytrium 



<400> 6 

Met Met Glu Pro 
1 

Arg Tyr Ala Ser 

20 

Lys Asp Ser Phe 

35 

Leu Val Gly Ser 
50 

Ala Leu Arg Asn 
65 



Leu Asp Arg Tyr 
5 

Ser Ala Ala Phe 



Val Gly Pro Leu 

40 

Val Val Leu Tyr 

55 

Tyr Leu Gly Gly 
70 



Arg Ala Leu Ala 

10 

Lys Trp Gin Val 
25 

Gly lie Arg Glu 



Leu Ser Leu Leu 

60 

Leu Met Ala Leu 

75 



Glu Leu Ala Ala 

15 

Thr Tyr Asp Ala 
30 

Pro Leu Gly Leu 
45 

Ala Val Val Tyr 



Arg Ser Val His 

80 
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Asn Leu Gly Leu Cys 

85 

Tyr Leu Met He Gin 

100 

Cys Glu Pro Leu Lys 
115 

Ala Leu Ser Lys He 
130 

Lys Gly Asn Lys Leu 
145 

Phe Trp Leu Tyr Ala 

165 

Gly Val Ala Val Asn 

180 

Phe Axg Pro Phe Pro 
195 

He Val Gin Phe He 
210 

His Tyr Asp Cys Glu 
225 

Thr Pro Tyr Leu Phe 

245 

Tyr Leu Gin Gin Tyr 

260 



14 

Leu Phe Ser Gly Ala Val 

90 

Asp Gly His Phe Arg Ser 

105 

His Pro His Phe Gin Leu 
120 

Trp Glu Trp Phe Asp Thr 
135 

Arg Phe Leu His Val Leu 
150 155 

He Asp His He Phe Leu 

170 

Ala Phe lie His Thr Val 

185 

Lys Gly Leu Arg Pro Leu 
200 

Phe Ser He Gly He His 
215 

Pro Leu Val His Thr His 
230 235 

Val Val Pro Phe Leu He 

250 

Val Leu Ala Pro Ala Lys 

265 



Trp He Tyr Thr Ser 

95 

Leu Glu Ala Ala Thr 
110 

lie Ser Leu Leu Phe 
125 

Val Leu Leu He Val 
140 

His His Ala Thr Thr 

160 

Ser Ser He Lys Tyr 

175 

Met Tyr Ala His Tyr 
190 

He Thr Gin Leu Gin 
205 

* » 

Thr Ala He Tyr Trp 
220 

Phe Trp Glu Tyr Val 

240 

Leu Phe Phe Asn Phe 

255 

Thr Lys Lys Ala 
270 



<210> 7 
<211> 2040 
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<212> DNA 

<213> Ceratodon purpureus 

<220> 
<221> CDS 

<222> (176).. (1627) 
<223> A6-desaturase 

<400> 7 

ctcaggcagg tctcagttga tgagacgctg agttctgaat cctttgagct gtgtcaggct 60 

cggcacttgt gggatggtga aggagtgatc gatcaggagt gcaggagctg cattagtttc 120 

tcagggtcga tcaggttatt ctgaaaaagg ctgcgtctgt gagcagtttg caaaa atg 17 8 

Met 



gcc etc gtt acc gac ttt ctg aac ttt ctg ggc acg aca tgg age aag 226 
Ala Leu Val Thr Asp Phe Leu Asn Phe Leu Gly Thr Thr Trp Ser Lys 

5 10 15 

tac age gtg tac acc cat age tat get gga aac tat ggg cct act ttg 274 
Tyr Ser Val Tyr Thr His Ser Tyr Ala Gly Asn Tyr Gly Pro Thr Leu 
20 25 30 

aag cac gcc aaa aag gtt tct get caa ggt aaa act gcg gga cag aca 322 
Lys His Ala Lys Lys Val Ser Ala Gin Gly Lys Thr Ala Gly Gin Thr 
35 40 45 

ctg aga cag aga teg gtg cag gac aaa aag cca ggc act tac tct ctg 370 
Leu Arg Gin Arg Ser Val Gin Asp Lys Lys Pro Gly Thr Tyr Ser Leu 

50 55 60 65 

gcc gat gtt get tct cac gac agg cct gga gac tgc tgg atg ate gtc 418 
Ala Asp Val Ala Ser His Asp Arg Pro Gly Asp Cys Trp Met lie Val 

70 75 80 



aaa gag aag gtg tat gat att age cgt ttt gcg gac gac cac cct gga 
Lys Glu Lys Val Tyr Asp He Ser Arg Phe Ala Asp Asp His Pro Gly 

85 90 95 



466 
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ggg acg gta att 
Gly Thr Val He 
100 

gca aca ttc cat 
Ala Thr Phe His 
115 

att gga gac ctt 
He Gly Asp Leu 
130 

tac aga gat atg 
Tyr Arg Asp Met 

tec aag gec tgg 
Ser Lys Ala Trp 

165 

get gcg age att 
Ala Ala Ser He 
180 

gtg ctg tea gee 
Val Leu Ser Ala 
195 

ctt gec cat gat 
Leu Ala His Asp 
210 

aac tec ttc ttt 
Asn Ser Phe Phe 

gta tea tgg tgg 
Val Ser Trp Trp 

245 



age acc tac 
Ser Thr Tyr 

cca cct gec 
Pro Pro Ala 
120 

get agg gaa 
Ala Arg Glu 
135 

aga gec gag 
Arg Ala Glu 
150 

ttc ctg ctt 
Phe Leu Leu 

gcg act ate 
Ala Thr He 



agt ttg atg 
Ser Leu Met 
200 

ttc ctt cat 
Phe Leu His 
215 

ggc tat ttg 
Gly Tyr Leu 
230 

agg acg aag 
Arg Thr Lys 



16 

ttt ggg egg gat 
Phe Gly Arg Asp 
105 

gca tgg aag caa 
Ala Trp Lys Gin 

gag ccc ctt gat 
Glu Pro Leu Asp 

140 

ttt gtt aga gaa 
Phe Val Arg Glu 
155 

cag act ctg att 
Gin Thr Leu lie 
170 

tgt tac gac aag 
Cys Tyr Asp Lys 
185 

ggt etc ttc gtc 
Gly Leu Phe Val 



caa cag gtc ttt 
Gin Gin Val Phe 

220 

ttc ggc aat tgc 
Phe Gly Asn Cys 
235 

cac aac att cat 
His Asn He His 
250 



ggc aca gac 
Gly Thr Asp 
110 

etc aat gac 
Leu Asn Asp 
125 

gaa ttg ctt 
Glu Leu Leu 

ggg ctt ttc 
Gly Leu Phe 

aat gca get 
Asn Ala Ala 
175 

agt tac tgg 
Ser Tyr Trp 
190 

caa cag tgt 
Gin Gin Cys 
205 

gag aac cgt 
Glu Asn Arg 

gtg ctt ggc 
Val Leu Gly 

cat act get 
His Thr Ala 
255 



gtt ttc 514 
Val Phe 

tac tac 562 
Tyr Tyr 

aaa gac 610 
Lys Asp 
145 

aag agt 658 

Lys Ser 

160 

etc ttt 706 
Leu Phe 

get att 754 
Ala He 



gga tgg 802 
Gly Trp 



acc gcg 850 
Thr Ala 
225 

ttt agt 898 

Phe Ser 

240 

ccg aat 946 
Pro Asn 
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gag tgc gac gaa cag tac aca cct eta gac gaa gac att gat act etc 994 

Glu Cys Asp Glu Gin Tyr Thr Pro Leu Asp Glu Asp Tie Asp Thr Leu 

260 265 270 

ccc ate att gee tgg age aag gaa att ttg gee acc gtt gag age aag 1042 

Pro lie lie Ala Trp Ser Lys Glu lie Leu Ala Thr Val Glu Ser Lys 
275 280 285 

aga att ttg cga gtg ctt caa tat cag cac tac atg att ctg cct eta 1090 

Arg lie Leu Arg Val Leu Gin Tyr Gin His Tyr Met lie Leu Pro Leu 
290 295 300 305 

ttg ttc atg gee egg tac agt tgg act ttt gga agt ttg etc ttc aca 1138 

Leu Phe Met Ala Arg Tyr Ser Trp Thr Phe Gly Ser Leu Leu Phe Thr 

310 315 320 

ttc aat cct gat ttg age acg acc aag gga ttg ata gag aag gga aca 118 6 

Phe Asn Pro Asp Leu Ser Thr Thr Lys Gly Leu lie Glu Lys Gly Thr 

325 330 335 

gtt get ttt cac tac gee tgg ttc agt tgg get gcg ttc cat att ttg 1234 

Val Ala Phe His Tyr Ala Trp Phe Ser Trp Ala Ala Phe His lie Leu 
340 345 . -350 

ccg ggt gtc get aag cct ctt gcg tgg atg gta gca act gag ctt gtg 1282 

Pro Gly Val Ala Lys Pro Leu Ala Trp Met Val Ala Thr Glu Leu Val 
355 360 365 

gec ggt ttg ttg ttg gga ttc gtg ttt acg ttg agt cac aat gga aag 1330 

Ala Gly Leu Leu Leu Gly Phe Val Phe Thr Leu Ser His Asn Gly Lys 
370 375 380 385 

gag gtt tac aat gaa teg aag gac ttc gtg aga gee cag gtt att acc 137 8 

Glu Val Tyr Asn Glu Ser Lys Asp Phe Val Arg Ala Gin Val lie Thr 

390 395 400 

acc cgt aac acc aag cga ggc tgg ttc aac gat tgg ttc act ggg gga 142 6 

Thr Arg Asn Thr Lys Arg Gly Trp Phe Asn Asp Trp Phe Thr Gly Gly 

405 410 415 
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etc gac acc cag att gag cat cac ctg ttt cca aca atg ccc agg cac 1474 
Leu Asp Thr Gin lie Glu His His Leu Phe Pro Thr Met Pro Arg His 
420 425 430 

aac tac ccc aag ate gca cct cag gtc gag get ctt tgc aag aag cac 1522 
Asn Tyr Pro Lys lie Ala Pro Gin Val Glu Ala Leu Cys Lys Lys His 
435 440 445 

ggc etc gag tac gat aat gtc tec gtc gtt ggt gec tct gtc gcg gtt 1570 
Gly Leu Glu Tyr Asp Asn Val Ser Val Val Gly Ala Ser Val Ala Val 
450 455 460 465 

gtg aag gcg etc aag gaa att get gat gaa gcg tea att egg ctt cac 1618 
Val Lys Ala Leu Lys Glu lie Ala Asp Glu Ala Ser lie Arg Leu His 

470 475 480 

m 

get cac taa gaaategteg aactttgact attcattttt ttcgcctggc 1667 
Ala His 

tacctcaaat gttegggage aggtgcttgg cagtgtgttc aaceggageg cactgaaaat 1727 
gtgeagaate catttccaga aattaccatt cctagctaaa tcttcttttt accaggtegg 1787 
atatatgaaa cttttttgat gcaacaagta gcattcaatt gaagacattg ttcgagatat 1847 
aattcgcagt gtttctattc agegggcata cgtactagtc catateggeg gttgecgaga 1907 
gtttacatta ttagttggca caacgagtag atctagtgta aatttctatt tccgcatgta 1967 
atattactct gaatatatac cgttatctat tttcctaaaa aaaaaaaaaa aaaaaaaaaa 2027 
aaaaaaaaaa aaa 2040 



<210> 8 
<211> 483 
<212> PRT 

<213> Ceratodon purpureus 
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<400> 8 

Met Ala Leu Val 
1 

Lys Tyr Ser Val 

20 

Leu Lys His Ala 
35 

Thr Leu Arg Gin 

50 

Leu Ala Asp Val 
65 

Val Lys Glu Lys 



Gly Gly Thr Val 

100 

Phe Ala Thr Phe 
115 

Tyr lie Gly Asp 
130 

Asp Tyr Arg Asp 
145 

Ser Ser Lys Ala 



Phe Ala Ala Ser 

180 

He Val Leu Ser 
195 



Thr Asp Phe Leu 
5 

Tyr Thr His Ser 



Lys Lys Val Ser 

40 

Arg Ser Val Gin 

55 

Ala Ser His Asp 
70 

Val Tyr Asp He 
85 

He Ser Thr Tyr 



His Pro Pro Ala 

120 

Leu Ala Arg Glu 

135 

Met Arg Ala Glu 
150 

Trp Phe Leu Leu 
165 

He Ala Thr He 



Ala Ser Leu Met 

200 



19 

Asn Phe Leu Gly 
10 

Tyr Ala Gly Asn 
25 

Ala Gin Gly Lys 



Asp Lys Lys Pro 

60 

Arg Pro Gly Asp 

75 

Ser Arg Phe Ala 
90 

Phe Gly Arg Asp 
105 

Ala Trp Lys Gin 



Glu Pro Leu Asp 

140 

Phe Val Arg Glu 
155 

Gin Thr Leu He 
170 

Cys Tyr Asp Lys 
185 

Gly Leu Phe Val 



Thr Thr Trp Ser 

15 

Tyr Gly Pro Thr 
30 

Thr Ala Gly Gin 
45 

Gly Thr Tyr Ser 



Cys Trp Met He 

80 

Asp Asp His Pro 

95 

Gly Thr Asp Val 
110 

Leu Asn Asp Tyr 
125 

Glu Leu Leu Lys 

Gly Leu Phe Lys 

160 

Asn Ala Ala Leu 
175 

Ser Tyr Trp Ala 
190 

Gin Gin Cys Gly 
205 
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Trp Leu Ala His 
210 

Ala Asn Ser Phe 
225 

Ser Val Ser Trp 



Asn Glu Cys Asp 

260 

Leu Pro lie lie 

275 

Lys Arg lie Leu 
290 

Leu Leu Phe Met 
305 

Thr Phe Asn Pro 



Thr Val Ala Phe 

340 

Leu Pro Gly Val 
355 

val Ala Gly Leu 
370 

Lys Glu Val Tyr 
385 

Thr Thr Arg Asn 



Gly Leu Asp Thr 



Asp Phe Leu His 
215 

Phe Gly Tyr Leu 
230 

Trp Arg Thr Lys 
245 

Glu Gin Tyr Thr 



Ala Trp Ser Lys 

280 

Arg Val Leu Gin 
295 

Ala Arg Tyr Ser 
310 

Asp Leu Ser Thr 
325 

His Tyr Ala Trp 



Ala Lys Pro Leu 

360 

Leu Leu Gly Phe 
375 

Asn Glu Ser Lys 
390 

Thr Lys Arg Gly 
405 

Gin lie Glu His 



20 

Gin Gin Val Phe 

220 

Phe Gly Asn Cys 
235 

His Asn He His 
250 

Pro Leu Asp Glu 
265 

Glu He Leu Ala 



Tyr Gin His Tyr 

300 

Trp Thr Phe Gly 
315 

Thr Lys Gly Leu 
330 

Phe Ser Trp Ala 
345 

Ala Trp Met Val 



Val Phe Thr Leu 

380 

Asp Phe Val Arg 

395 

Trp Phe Asn Asp 
410 

His Leu Phe Pro 



Glu Asn Arg Thr 



Val Leu Gly Phe 

240 

His Thr Ala Pro 
255 

Asp He Asp Thr 
270 

Thr Val Glu Ser 
285 

Met He Leu Pro 



Ser Leu Leu Phe 

320 

He Glu Lys Gly 
335 

Ala Phe His He 
350 

Ala Thr Glu Leu 
365 

Ser His Asn Gly 

Ala Gin Val He 

400 

Trp Phe Thr Gly 
415 

Thr Met Pro Arg 
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420 

His Asn Tyr Pro 
435 

His Gly Leu Glu 
450 

Val val Lys Ala 
465 

His Ala His 



Lys lie Ala Pro 

440 

Tyr Asp Asn Val 
455 

Leu Lys Glu lie 
470 



21 

425 

Gin Val Glu Ala 



Ser Val Val Gly 

460 

Ala Asp Glu Ala 
475 



430 

Leu Cys Lys Lys 
445 

Ala Ser Val Ala 

Ser lie Arg Leu 

480 



<210> 9 . 
<211> 1467 
<212> DNA 

<213> Ceratodon purpureus 

<220> 
<221> CDS 

<222> (10}.. (1461) 
<223> A6-desaturase 

<400> 9 

ggatccaaa atg gcc etc gtt acc gac ttt ctg aac ttt ctg ggc acg aca 51 

Met Ala Leu Val Thr Asp Phe Leu Asn Phe Leu Gly Thr Thr 
15 10 



99 



tgg age aag tac age gtg tac acc cat age tat get gga aac tat ggg 
Trp Ser Lys Tyr Ser Val Tyr Thr His Ser Tyr Ala Gly Asn Tyr Gly 
15 20 25 30 



cct act ttg aag cac gcc aaa aag gtt tct get caa ggt aaa act gcg 147 
Pro Thr Leu Lys His Ala Lys Lys Val Ser Ala Gin Gly Lys Thr Ala 

35 40 45 

gga cag aca ctg aga cag aga teg gtg cag gac aaa aag cca ggc act 195 
Gly Gin Thr Leu Arg Gin Arg Ser Val Gin Asp Lys Lys Pro Gly Thr 

50 55 60 
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tac tct ctg gcc gat gtt get tct cac gac agg cct gga gac tgc tgg 24 3 
Tyr Ser Leu Ala Asp Val Ala Ser His Asp Arg Pro Gly Asp Cys Trp 

65 70 75 

atg ate gtc aaa gag aag gtg tat gat att age cgt ttt gcg gac gac 291 
Met He Val Lys Glu Lys Val Tyr Asp He Ser Arg Phe Ala Asp Asp 
80 85 90 

cac cct gga ggg acg gta att age acc tac ttt ggg egg gat ggc aca 339 
His Pro Gly Gly Thr Val He Ser Thr Tyr Phe Gly Arg Asp Gly Thr 
95 100 105 HO 

gac gtt ttc gca aca ttc cat cca cct gcc gca tgg aag caa etc aat 387 
Asp Val Phe Ala Thr Phe His Pro Pro Ala Ala Trp Lys Gin Leu Asn 

115 120 125 

gac tac tac att gga gac ctt get agg gaa gag ccc ctt gat gaa ttg 435 
Asp Tyr Tyr He Gly Asp Leu Ala Arg Glu Glu Pro Leu Asp Glu Leu 

130 135 140 

ctt aaa gac tac aga gat atg aga gcc gag ttt gtt aga gaa ggg ctt 483 
Leu Lys Asp Tyr Arg Asp Met Arg Ala Glu Phe Val Arg Glu Gly Leu 
145 150 - 155 

ttc aag agt tec aag gcc tgg ttc ctg ctt cag act ctg att aat gca 531 
Phe Lys Ser Ser Lys Ala Trp Phe Leu Leu Gin Thr Leu He Asn Ala 
160 165 170 

get etc ttt get gcg age att gcg act ate tgt tac gac aag agt tac 579 
Ala Leu Phe Ala Ala Ser He Ala Thr He Cys Tyr Asp Lys Ser Tyr 
175 180 185 190 

tgg get att gtg ctg tea gcc agt ttg atg ggt etc ttc gtc caa cag 627 
Trp Ala He Val Leu Ser Ala Ser Leu Met Gly Leu Phe Val Gin Gin 

195 200 205 

tgt gga tgg ctt gcc cat gat ttc ctt cat caa cag gtc ttt gag aac 6 75 
Cys Gly Trp Leu Ala His Asp Phe Leu His Gin Gin Val Phe Glu Asn 

210 215 220 
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cgt acc gcg aac tec ttc ttt ggc tat ttg ttc ggc aat tgc gtg ctt 723 
Arg Thr Ala Asn Ser Phe Phe Gly Tyr Leu Phe Gly Asn Cys Val Leu 
225 230 235 

ggc ttt agt gta tea tgg tgg agg acg aag cac aac att cat cat act 771 
Gly Phe Ser Val Ser Trp Trp Arg Thr Lys His Asn lie His His Thr 
240 245 250 

get ccg aat gag tgc gac gaa cag tac aca cct eta gac gaa gac att 819 
Ala Pro Asn Glu Cys Asp Glu Gin Tyr Thr Pro Leu Asp Glu Asp lie 

255 260 265 270 

< 

gat act etc ccc ate att gee tgg age aag gaa att ttg gec acc gtt 867 
Asp Thr Leu Pro He He Ala Trp Ser Lys Glu He Leu Ala Thr Val 

275 280 285 

gag age aag aga att ttg cga gtg ctt caa tat cag cac tac atg att 915 
Glu Ser Lys Arg He Leu Arg Val Leu Gin Tyr Gin His Tyr Met He 

290 295 300 

ctg cct eta ttg ttc atg gec egg tac agt tgg act ttt gga agt ttg 963 
Leu Pro Leu Leu Phe Met Ala Arg Tyr Ser Trp Thr Phe Gly Ser Leu 
305 310 . 315 

etc ttc aca ttc aat cct gat ttg age acg acc aag gga ttg ata gag 10 11 
Leu Phe Thr Phe Asn Pro Asp Leu Ser Thr Thr Lys Gly Leu He Glu 
320 325 330 

aag gga aca gtt get ttt cac tac gec tgg ttc agt tgg get gcg ttc 1059 
Lys Gly Thr Val Ala Phe His Tyr Ala Trp Phe Ser Trp Ala Ala Phe 
335 340 345 350 

cat att ttg ccg ggt gtc get aag cct ctt gcg tgg atg gta gca act 1107 
His He Leu Pro Gly Val Ala Lys Pro Leu Ala Trp Met Val Ala Thr 

355 360 365 

gag ctt gtg gec ggt ttg ttg ttg gga ttc gtg ttt acg ttg agt cac 1155 
Glu Leu Val Ala Gly Leu Leu Leu Gly Phe Val Phe Thr Leu Ser His 

370 375 380 
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■ . 

* 

aat gga aag gag gtt tac aat gaa teg aag gac ttc gtg aga gec cag 1203 
Asn Gly Lys Glu Val Tyr Asn Glu Ser Lys Asp Phe Val Arg Ala Gin 
385 390 395 

gtt att acc acc cgt aac acc aag cga ggc tgg ttc aac gat tgg ttc 1251 
Val lie Thr Thr Arg Asn Thr Lys Arg Gly Trp Phe Asn Asp Trp Phe 
400 405 410 

act ggg gga etc gac acc cag att gag cat cac ctg ttt cca aca atg 1299 
Thr Gly Gly Leu Asp Thr Gin lie Glu His His Leu Phe Pro Thr Met 
415 420 425- 430 

ccc agg cac aac tac ccc aag ate gca cct cag gtc gag get ctt tgc 1347 
Pro Arg His Asn Tyr Pro Lys lie Ala Pro Gin Val Glu Ala Leu Cys 

435 440 445 

aag aag cac ggc etc gag tac gat aat gtc tec gtc gtt ggt gee tct 1395 
Lys Lys His Gly Leu Glu Tyr Asp Asn Val Ser Val Val Gly Ala Ser 

450 455 460 

gtc gcg gtt gtg aag gcg etc aag gaa att get gat gaa gcg tea att 1443 
Val Ala Val Val Lys Ala Leu Lys Glu lie Ala Asp Glu Ala Ser lie 
465 470 - 475 

egg ctt cac get cac taa gtcgac 1467 
Arg Leu His Ala His 
480 



<210> 10 
<211> 483 
<212> PRT 

<213> Ceratodon purpureus 
<400> 10 

Met Ala Leu Val Thr Asp Phe Leu Asn Phe Leu Gly Thr Thr Trp Ser 
15 10 15 

Lys Tyr Ser Val Tyr Thr His Ser Tyr Ala Gly Asn Tyr Gly Pro Thr 
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20 25 '30 

Leu Lys His Ala Lys Lys Val Ser Ala Gin Gly Lys Thr Ala Gly Gin 

35 40 45 

Thr Leu Arg Gin Arg Ser Val Gin Asp Lys Lys Pro Gly Thr Tyr Ser 
50 55 60 

Leu Ala Asp Val Ala Ser His Asp Arg Pro Gly Asp Cys Trp Met lie 
65 70 75 80 

Val Lys Glu Lys Val Tyr Asp lie Ser Arg Phe Ala Asp Asp His Pro 

85 90 95 

Gly Gly Thr Val lie Ser Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val 

100 105 110 

Phe Ala Thr Phe His Pro Pro Ala Ala Trp Lys Gin Leu Asn Asp Tyr 
115 120 125 

Tyr lie Gly Asp Leu Ala Arg Glu Glu Pro Leu Asp Glu Leu Leu Lys 
130 135 140 

Asp Tyr Arg Asp Met Arg Ala Glu Phe Val Arg Glu Gly Leu Phe Lys 
145 150 155 160 

Ser Ser Lys Ala Trp Phe Leu Leu Gin Thr Leu lie Asn Ala Ala Leu 

165 170 175 

Phe Ala Ala Ser He Ala Thr He Cys Tyr Asp Lys Ser Tyr Trp Ala 

180 185 190 

He Val Leu Ser Ala Ser Leu Met Gly Leu Phe Val Gin Gin Cys Gly 
195 200 205 

Trp Leu Ala His Asp Phe Leu His Gin Gin Val Phe Glu Asn Arg Thr 
210 215 220 



Ala Asn Ser Phe Phe Gly Tyr Leu Phe Gly Asn Cys val Leu Gly Phe 
225 230 235 240 
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Ser Val Ser Trp 



Asn Glu Cys Asp 

2 60 

Leu Pro lie lie 
275 

Lys Arg lie Leu 
290 

Leu Leu Phe Met 
305 

Thr Phe Asn Pro 



Thr Val Ala Phe 

340 

Leu Pro Gly Val 
355 

Val Ala Gly Leu 
370 

Lys Glu Val Tyr 
385 

Thr Thr Arg Asn 



Gly Leu Asp Thr 

420 

His Asn Tyr Pro 
435 



Trp Arg Thr Lys 
245 

Glu Gin Tyr Thr 



Ala Trp Ser Lys 

280 

Arg Val Leu Gin 
295 

Ala Arg Tyr Ser 
310 

Asp Leu Ser Thr 
325 

His Tyr Ala Trp 



Ala Lys Pro Leu 

360 

Leu Leu Gly Phe 
375 

Asn Glu Ser Lys 
390 

Thr Lys Arg Gly 
405 

Gin lie Glu His 



Lys lie Ala Pro 

440 



26 

His Asn lie His 
250 

Pro Leu Asp Glu 
265 

Glu lie Leu Ala 



Tyr Gin His Tyr 

300 

Trp Thr Phe Gly 
315 

Thr Lys Gly Leu 
330 

Phe Ser Trp Ala 
345 

Ala Trp Met Val 



Val Phe Thr Leu 

380 

Asp Phe Val Arg 
395 

Trp Phe Asn Asp 
410 

His Leu Phe Pro 
425 

Gin Val Glu Ala 



His Thr Ala Pro 
255 

Asp lie Asp Thr 
270 

Thr Val Glu Ser 
285 

Met lie Leu Pro 



Ser Leu Leu Phe 

320 

lie Glu Lys Gly 
335 

Ala Phe His lie 
350 

Ala Thr Glu Leu 
365 

Ser His Asn Gly 

Ala Gin Val He 

400 

Trp Phe Thr Gly 
415 

Thr Met Pro Arg 

430 

Leu Cys Lys Lys 
445 
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His Gly Leu Glu Tyr Asp Asn Val Ser Val val Gly Ala Ser Val Ala 
450 455 460 

Val Val Lys Ala Leu Lys Glu lie Ala Asp Glu Ala Ser He Arg Leu 
465 470 475 480 

His Ala His 



<210> 11 
<211> 2160 
<212> DNA 

<213> Ceratodon purpureus 
<220> 

* 

<221> CDS 

<222> (159) (1721) 
<223> A6-desaturase 

<400> 11 

cggaggtctc ttgtcgttct tggagtctgt. gtcgagcttg gaatgcggta ggcgcggccg 60 

tttcgtggtt ttggcgttgg cattgcgcga gggcggacag tgggagtgcg ggaggtctgt 120 

ttgtgcatga cgaggtggtt gtaatcttcg ccggcaga atg gtg tec cag ggc ggc 176 

Met Val Ser Gin Gly Gly 
1 5 

ggt etc teg cag ggt tec att gaa gaa aac att gac gtt gag cac ttg 224 
Gly Leu Ser Gin Gly Ser He Glu Glu Asn He Asp Val Glu His Leu 

10 15 20 

gca acg atg ccc etc gtc agt gac ttc eta aat gtc ctg gga acg act 272 
Ala Thr Met Pro Leu Val Ser Asp Phe Leu Asn Val Leu Gly Thr Thr 

25 30 35 

ttg ggc cag tgg agt ctt tec act aca ttc get ttc aag agg etc acg 320 
Leu Gly Gin Trp Ser Leu Ser Thr Thr Phe Ala Phe Lys Arg Leu Thr 

40 45 50 
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act aag aaa cac agt teg gac ate teg gtg gag gca caa aaa gaa teg 368 
Thr Lys Lys His Ser Ser Asp He Ser Val Glu Ala Gin Lys Glu Ser 
55 60 65 70 

gtt gcg egg ggg cca gtt gag aat att tct caa teg gtt gcg cag ccc 416 
Val Ala Arg Gly Pro Val Glu Asn He Ser Gin Ser Val Ala Gin Pro 

75 80 85 



ate agg egg agg tgg gtg cag gat aaa aag ccg gtt act tac age ctg 464 
He Arg Arg Arg Trp Val Gin Asp Lys Lys Pro Val Thr Tyr Ser Leu 

90 95 100 

aag gat gta get teg cac gat atg ccc cag gac tgc tgg att ata ate 512 
Lys Asp Val Ala Ser His Asp Met Pro Gin Asp Cys Trp He He He 
105 110 115 

aaa gag aag gtg tat gat gtg age acc ttc get gag cag cac cct gga 560 
Lys Glu Lys Val Tyr Asp Val Ser Thr Phe Ala Glu Gin His Pro Gly 
120 125 130 



ggc acg gtt ate aac acc tac ttc gga cga gac gec aca gat gtt ttc 608 
Gly Thr Val He Asn Thr Tyr Phe Gly Arg Asp Ala Thr Asp Val Phe 
135 140 145 150 

tct act ttc cac gca tec acc tea tgg aag att ctt cag aat ttc tac 656 
Ser Thr Phe His Ala Ser Thr Ser Trp Lys He Leu Gin Asn Phe Tyr 

155 160 165 



ate ggg aac ctt gtt agg gag gag ccg act ttg gag ctg ctg aag gag 704 
He Gly Asn Leu Val Arg Glu Glu Pro Thr Leu Glu Leu Leu Lys Glu 

170 175 180 



tac aga gag ttg aga gee ctt ttc ttg aga gaa cag ctt ttc aag agt 752 
Tyr Arg Glu Leu Arg Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser 
185 190 195 

tec aaa tec tac tac ctt ttc aag act etc ata aat gtt tec att gtt 800 
Ser Lys Ser Tyr Tyr Leu Phe Lys Thr Leu He Asn Val Ser He Val 
200 205 210 
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gcc aca age 
Ala Thr Ser 
215 

ctg tta tea 
Leu Leu Ser 

ttg tct cac 
Leu Ser His 

aat gac gtt 
Asn Asp Val 
265 

gtc teg tgg 
Val Ser Trp 
280 

gaa tgc gac 
Glu Cys Asp 
295 

ccc ate att 
Pro He He 



acc atg ttg 
Thr Met Leu 

ttg acg ttt 
Leu Thr Phe 
345 

etc agg ccc 
Leu Arg Pro 
360 



att gcg ata ate 
He Ala He He 
220 

gcc agt ttg atg 
Ala Ser Leu Met 
235 

gat ttt eta cac 
Asp Phe Leu His 
250 

gtt ggc tat gtg 
Val Gly Tyr Val 

tgg aag acc aag 

Trp Lys Thr Lys 

285 

caa aag tac aca 

Gin Lys Tyr Thr 
300 

get tgg agt aaa 

Ala Trp Ser Lys 
315 

cga gtt ctt cag 

Arg Val Leu Gin 
330 

gcc egg gcg agt 

Ala Arg Ala Ser 

gag ttg acc ctt 
Glu Leu Thr Leu 

365 



29 

agt ctg tac aag 
Ser Leu Tyr Lys 

225 

ggc ttg ttt att 
Gly Leu Phe He 
240 

cat cag gta ttt 
His Gin Val Phe 
255 

gtc ggc aac gtt 
Val Gly Asn Val 
270 

cac aac ctg cat 
His Asn Leu His 

ccg att gat gag 
Pro He Asp Glu 

305 

m 

gat etc ttg gcc 
Asp Leu Leu Ala 
320 

tac cag cac eta 
Tyr Gin His Leu 
335 

tgg eta ttt tgg 
Trp Leu Phe Trp 
350 

ggc gag aag ctt 
Gly Glu Lys Leu 



tct tac egg 
Ser Tyr Arg 

caa cag tgc 
Gin Gin Cys 

gag aca cgc 
Glu Thr Arg 
260 

gtt ctg gga 
Val Leu Gly 
275 

cat get get 
His Ala Ala 
290 

gat att gat 
Asp lie Asp 

act gtt gag 
Thr Val Glu 

ttc ttt ttg 
Phe Phe Leu 
340 

age gcg gcc 
Ser Ala Ala 
355 

ttg gag agg 
Leu Glu Arg 
370 



gcg gtt 848 
Ala Val 
230 

gga tgg 896 

Gly Trp 

245 

tgg etc 944 
Trp Leu 

ttc agt 992 
Phe Ser 

ccg aat 1040 
Pro Asn 

act etc 1088 
Thr Leu 
310 

age aag 1136 

Ser Lys 

325 

gtt ctt 1184 
Val Leu 



ttc act 1232 
Phe Thr 



gga acg 1280 
Gly Thr 
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atg get ttg cac tac att tgg ttt aat agt gtt gcg ttt tat ctg etc 1328 
Met Ala Leu His Tyr lie Trp Phe Asn Ser Val Ala Phe Tyr Leu Leu 
375 380 385 390 

ccc gga tgg aaa cca gtt gta tgg atg gtg gtc age gag etc atg tct 1376 
Pro Gly Trp Lys Pro Val Val Trp Met Val Val Ser Glu Leu Met Ser 

395 400 405 

ggt ttc ctg ctg gga tac gta ttt gta etc agt cac aat gga atg gag 1424 
Gly Phe Leu Leu Gly Tyr Val Phe Val Leu Ser His Asn Gly Met Glu 

410 415 420 

gtg tac aat acg tea aag gac ttc gtg aat gee cag att gca teg act 1472 
Val Tyr Asn Thr Ser Lys Asp Phe Val Asn Ala Gin He Ala S$r Thr 
425 430 435 

cgc gac ate aaa gca ggg gtg ttt aat gat tgg ttc acc gga ggt etc 1520 
Arg Asp He Lys Ala Gly Val Phe Asn Asp Trp Phe Thr Gly Gly Leu 
440 445 450 

■ 

aac aga cag att gag cat cat eta ttt cca acg atg ccc agg cac aac 1568 
Asn Arg Gin He Glu His His Leu Phe Pro Thr Met Pro Arg His Asn 
455 460 465 470 

ctt aat aaa att tct cct cac gtg gag act ttg tgc aag aag cat gga 1616 
Leu Asn Lys He Ser Pro His Val Glu Thr Leu Cys Lys Lys His Gly 

475 480 485 

ctg gtc tac gaa gac gtg age atg get teg ggc act tac egg gtt ttg 1664 
Leu Val Tyr Glu Asp Val Ser Met Ala Ser Gly Thr Tyr Arg Val Leu 

490 495 500 

aaa aca ctt aag gac gtt gee gat get get tea cac cag cag ctt get 1712 
Lys Thr Leu Lys Asp Val Ala Asp Ala Ala Ser His Gin Gin Leu Ala 
505 510 515 



gcg agt tga ggcatcgcag cactcgtcga aacatttttg tctgttatag 
Ala Ser 
520 



1761 
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tgttcatatg tgatcgaggg gaaaaggtcc catgctctga tctatfcttc tgtagccaat 1821 

atttttcaat tgaaaggagg ttcctcactt atcttccatc tatcgttgca catcctgcat 1881 

cagagttagc gttggagtaa tgttaagcac ttgtagatta tgcccaccat tgccacattt 1941 

ctgttcggtt acaatcgttt gattccatgc tatcctccgt gttcatctcg ttgttataag 2001 

caagcttgaa aaaacatgct acgagattgg cagacgttgt cttggcagct gtagaggttg 2 061 

gttccattca ttgtgtagta cagaactctc tcgtccctgt ttctctacat tacttgttac 2121 

atagtgactt tcattcacag caaaaaaaaa aaaaaaaaa 2160 



<210> 12 
<211> 520 
<212> PRT 

<213> Ceratodon purpureus 
<400> 12 

Met Val Ser Gin Gly Gly Gly Leu Ser Gin Gly Ser lie Glu Glu Asn 
1 5 10 15 



lie Asp Val Glu His Leu Ala Thr Met Pro Leu Val Ser Asp Phe Leu 

20 25 30 

Asn Val Leu Gly Thr Thr Leu Gly Gin Trp Ser Leu Ser Thr Thr Phe 

35 40 45 

Ala Phe Lys Arg Leu Thr Thr Lys Lys His Ser Ser Asp lie Ser Val 
50 55 60 

Glu Ala Gin Lys Glu Ser Val Ala Arg Gly Pro Val Glu Asn He Ser 
65 70 75 60 

Gin Ser Val Ala Gin Pro He Arg Arg Arg Trp Val Gin Asp Lys Lys 

85 90 95 

Pro Val Thr Tyr Ser Leu Lys Asp val Ala Ser His Asp Met Pro Gin 
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100 



105 



' 110 



Asp Cys Trp He He He Lys Glu Lys Val Tyr Asp Val Ser Thr Phe 
115 120 125 

Ala Glu Gin His Pro Gly Gly Thr Val He Asn Thr Tyr Phe Gly Arg 
130 135 140 

Asp Ala Thr Asp Val Phe Ser Thr Phe His Ala Ser Thr Ser Trp Lys 
145 150 155 160 

He Lea Gin Asn Phe Tyr He Gly Asn Leu Val Arg Glu Glu Pro Thr 

165 170 175 

Leu Glu Leu Leu Lys Glu Tyr Arg Glu Leu Arg Ala Leu Phe Leu Arg 

180 185 190 



Glu Gin Leu Phe Lys Ser 
195 



Ser Lys Ser Tyr Tyr Leu Phe Lys Thr Leu 
200 205 



He Asn Val Ser He Val Ala Thr Ser He Ala He lie Ser Leu Tyr 
210 215 220 



Lys Ser Tyr Arg Ala Val Leu Leu Ser Ala Ser Leu Met Gly Leu Phe 
225 230 235 240 



He Gin Gin Cys Gly Trp Leu Ser His Asp Phe Leu His His Gin Val 

245 250 255 

Phe Glu Thr Arg Trp Leu Asn Asp Val Val Gly Tyr Val Val Gly Asn 

260 265 270 

Val Val Leu Gly Phe Ser Val Ser Trp Trp Lys Thr Lys His Asn Leu 
275 280 285 

His His Ala Ala Pro Asn Glu Cys Asp Gin Lys Tyr Thr Pro lie Asp 
290 295 300 



Glu Asp He Asp Thr Leu Pro He He Ala Trp Ser Lys Asp Leu Leu 
305 310 315 320 
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Ala Thr Val Glu Ser 

325 

Leu Phe Phe Leu Val 

340 

Trp Ser Ala Ala Phe 
355 

Leu Leu Glu Arg Gly 
370 

Val Ala Phe Tyr Leu 
385 

Val Ser Glu Leu Met 

405 

Ser His Asn Gly Met 

420 

Ala Gin He Ala Ser 
435 

Trp Phe Thr Gly Gly 
450 

Thr Met Pro Arg His 
465 

Leu Cys Lys Lys His 

485 

Gly Thr Tyr Arg Val 

500 

Ser His Gin Gin Leu 
515 



33 

Lys Thr Met Leu Arg Val 

330 

Leu Leu Thr Phe Ala Arg 

345 

Thr Leu Arg Pro Glu Leu 

360 

Thr Met Ala Leu His Tyr 

375 

Leu Pro Gly Trp Lys Pro 
390 395 

Ser Gly Phe Leu Leu Gly 

410 

Glu Val Tyr Asn Thr Ser 

425 

Thr Arg Asp He Lys Ala 
440 . 

Leu Asn Arg Gin He Glu 
455 

Asn Leu Asn Lys He Ser 
470 475 

Gly Leu Val Tyr Glu Asp 

490 

Leu Lys Thr Leu Lys Asp 

505 

Ala Ala Ser 
520 



Leu Gin Tyr Gin His 

335 

Ala Ser Trp Leu Phe 
350 

Thr Leu Gly Glu Lys 

• 365 

He Trp Phe Asn Ser 
380 

Val Val Trp Met Val 

400 

Tyr Val Phe Val Leu 

415 

Lys Asp Phe Val Asn 
430 

Gly Val Phe Asn Asp 
445. 

His His Leu Phe Pro 
460 

Pro His Val Glu Thr 

480 

Val Ser Met Ala Ser 

495 

Val Ala Asp Ala Ala 
510 
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<210> 13 
<211> 1434 
<212> DNA 

<213> Phaeodactylum tricornutum 

<220> 

<221> CDS 

<222> (1) . . (1434) 

<223> A6-desaturase 

<400> 13 

atg ggc aaa gga ggg gac get egg gee teg aag ggc tea acg gcg get 48 
Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr Ala Ala 
15 10 15 

> 

cgc aag ate agt tgg cag gaa gtc aag acc cac gcg tct ccg gag gac 96 
Arg Lys lie Ser Trp Gin Glu Val Lys Thr His Ala Ser Pro Glu Asp 

20 25 30 

gee tgg ate att cac tec aat aag gtc tac gac gtg tec aac tgg cac 144 
Ala Trp lie lie His Ser Asn Lys Val Tyr Asp Val Ser Asn Trp His 

35 40 45 

gaa cat ccc gga ggc gec gtc att ttc acg cac gec ggt gac gac atg 192 
Glu His Pro Gly Gly Ala Val lie Phe Thr His Ala Gly Asp Asp Met 
50 55 60 

acg gac att ttc get gec ttt cac gca ccc gga teg cag teg etc atg 240 
Thr Asp lie Phe Ala Ala Phe His Ala Pro Gly Ser Gin Ser Leu Met 
65 70 75 80 

aag aag ttc tac att ggc gaa ttg etc ccg gaa acc acc ggc aag gag 288 
Lys Lys Phe Tyr He Gly Glu Leu Leu Pro Glu Thr Thr Gly Lys Glu 

85 90 95 

ccg cag caa ate gec ttt gaa aag ggc tac cgc gat ctg cgc tec aaa 33 6 
Pro Gin Gin He Ala Phe Glu Lys Gly Tyr Arg Asp Leu Arg Ser Lys 

100 105 HO 



0093/00064 



CA 02485060 2004-10-28 



etc ate atg 
Leu lie Met 
115 

aag tgc etc 
Lys Cys Leu 
130 

ttt tac teg 
Phe Tyr Ser 
145 

gga aca ttc 
Gly Thr Phe 

cac cag gtc 
His Gin Val 



tgg ggg aac 
Trp Gly Asn 
195 

cac aac gga 
His Asn Gly 
210 

gcg caa gat 
Ala Gin Asp 
225 

tec gtc cag 
Ser val Gin 

gat teg ggt 
Asp ser Gly 



atg ggc atg 
Met Gly Met 

age aac atg 
Ser Asn Met 

gac cgc ttc 
Asp Arg Phe 
150 

ttt cag cag 
Phe Gin Gin 
165 

ttc ace aag 
Phe Thr Lys 
180 

etc atg cag 
Leu Met Gin 

cac cac gee 
His His Ala 

ggg gac ccg 
Gly Asp Pro 
230 

caa gee cag 
Gin Ala Gin 
245 

-ttg gtc aag 
Leu Val Lys 
260 



ttc aag tec 
Phe Lys Ser 
120 

gee att tgg 
Ala lie Trp 
135 

tgg gta cac 
Trp Val His 



teg gga tgg 
Ser Gly Trp 

cgc aag cac 
Arg Lys His 
185 

ggt tac tec 
Gly Tyr Ser 
200 

gtc ccc aac 
Val Pro Asn 
215 

gac ate gat 
Asp lie Asp 

tct tac egg 
Ser Tyr Arg 

ttc atg ate 
Phe Met He 
265 



35 

aac aag tgg 
Asn Lys Trp 

gee gee gee 
Ala Ala Ala 
140 

ctg gee age 
Leu Ala Ser 
155 

ttg gca cac 
Leu Ala His 
170 

ggg gat etc 
Gly Asp Leu 

gta cag tgg 
Val Gin Trp 

etc cac tgc 
Leu His Cys 
220 

ace atg ccc 
Thr Met Pro 
235 

gaa etc caa 
Glu Leu Gin 
250 

egt aac caa 
Arg Asn Gin 



ttc tac gtc 
Phe Tyr Val 
125 

tgt get etc 
Cys Ala Leu 

gec gtc atg 
Ala Val Met 

gac ttt ctg 
Asp Phe Leu 
175 

gga gga etc 
Gly Gly Leu 
190 

tgg aaa aac 
Trp Lys Asn 
205 

tec tec gca 
Ser Ser Ala 

ctt etc gee 
Leu Leu Ala 

gec gac gga 
Ala Asp Gly 
255 

tec tac ttt 
Ser Tyr Phe 

270 



tac 384 
Tyr 

gtc 432 
Val 

ctg 4 80 

Leu 

160 

cac 528 
His 



ttt 576 
Phe 



aag 624 
Lys 

gtc 672 
Val 

tgg 720 

Trp 

240 

aag 7 6 8 
Lys 

tac 816 

Tyr 
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ttt ccc ate ttg ttg etc gee cgc ctg teg tgg ttg aac gag tec ttc 864 
Phe Pro lie Leu Leu Leu Ala Arg Leu Ser Trp Leu Asn Glu Ser Phe 
275 280 285 

aag tgc gec ttt ggg ctt gga get gcg teg gag aac get get etc gaa 912 
Lys cys Ala Phe Gly Leu Gly Ala Ala Ser Glu Asn Ala Ala Leu Glu 
290 295 300 

etc aag gec aag ggt ctt cag tac ccc ctt ttg gaa aag get ggc ate 960 
Leu Lys Ala Lys Gly Leu Gin Tyr Pro Leu Leu Glu Lys Ala Gly He 
305 310 315 320 

ctg ctg cac tac get tgg atg ctt aca gtt teg tec ggc ttt gga cgc 1008 

Leu Leu His Tyr Ala Trp Met Leu Thr Val Ser Ser Gly Phe Gly Arg 

325 330 335 

» 

ttc teg ttc gcg tac ace gca ttt tac ttt eta ace gcg acc gcg tec 1056 

Phe Ser Phe Ala Tyr Thr Ala Phe. Tyr Phe Leu Thr Ala Thr Ala Ser 

340 345 350 

tgt gga ttc ttg etc gee att gtc ttt ggc etc ggc cac aac ggc atg 1104 
Cys Gly Phe Leu Leu Ala He Val Phe Gly Leu Gly His Asn Gly Met 
355 360 365 

gec acc tac aat gec gac gec cgt ccg gac ttc tgg aag etc caa gtc 1152 
Ala Thr Tyr Asn Ala Asp Ala Arg Pro Asp Phe Trp Lys Leu Gin Val 
370 375 380 

acc acg act cgc aac gtc acg ggc gga cac ggt ttc ccc caa gec ttt 12 00 
Thr Thr Thr Arg Asn Val Thr Gly Gly His Gly Phe Pro Gin Ala Phe 
385 390 395 400 

gtc gac tgg ttc tgt ggt ggc etc cag tac caa gtc gac cac cac tta 1248 
Val Asp Trp Phe Cys Gly Gly Leu Gin Tyr Gin Val Asp His His Leu 

405 410 415 

ttc ccc age ctg ccc cga cac aat ctg gec aag aca cac gca ctg gtc 1296 
Phe Pro Ser Leu Pro Arg His Asn Leu Ala Lys Thr His Ala Leu Val 

420 425 430 
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gaa teg ttc tgc aag gag tgg ggt gtc cag tac cac gaa gec gac ctt 1344 
Glu Ser Phe Cys Lys Glu Trp Gly Val Gin Tyr His Glu Ala Asp Leu 
435 440 445 

gtg gac ggg acc atg gaa gtc ttg cac cat ttg ggc age gtg gec ggc 1392 
Val Asp Gly Thr Met Glu Val Leu His His Leu Gly Ser Val Ala Gly 
450 455 460 

gaa ttc gtc gtg gat ttt gta cgc gat gga ccc gec atg taa 143 4 

Glu Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
465 470 475 



<210> 14 
<211> 477 
<212> PRT 

<213> Phaeodactylum tricornutum 
<400> 14 

Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr Ala Ala 
1 5 .10 15 

Arg Lys He Ser Trp Gin Glu Val Lys Thr His Ala Ser Pro Glu Asp 

20 .25 30 

Ala Trp He He His Ser Asn Lys Val Tyr Asp Val Ser Asn Trp His 

35 40 45 

Glu His Pro Gly Gly Ala Val He Phe Thr His Ala Gly Asp Asp Met 
50 55 60 

Thr Asp He Phe Ala Ala Phe His Ala Pro Gly Ser Gin Ser Leu Met 
65 70 75 80 

Lys Lys Phe Tyr He Gly Glu Leu Leu Pro Glu Thr Thr Gly Lys Glu 

85 90 95 

Pro Gin Gin He Ala Phe Glu Lys Gly Tyr Arg Asp Leu Arg Ser Lys 

100 105 110 
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Leu lie Met Met 
115 

Lys Cys Leu Ser 
130 

Phe Tyr Ser Asp 
145 

Gly Thr Phe Phe 

His Gin Val Phe 

180 

Trp Gly Asn Leu 
195 

His Asn Gly His 
210 

Ala Gin Asp Gly 
225 

Ser Val Gin Gin 



Asp Ser Gly Leu 

260 

Phe Pro He Leu 
275 

Lys Cys Ala Phe 
290 

Leu Lys Ala Lys 
305 

Leu Leu Kis Tyr 



Gly Met Phe Lys 

120 

Asn Met Ala He 
135 

Arg Phe Trp Val 
150 

Gin Gin Ser Gly 
165 

Thr Lys Arg Lys 



Met Gin Gly Tyr 

200 

His Ala Val Pro 
215 

Asp Pro Asp He 
230 

Ala Gin Ser Tyr 
245 

Val Lys Phe Met 

Leu Leu Ala Arg 

280 

Gly Leu Gly Ala 
295 

Gly Leu Gin Tyr 
310 

Ala Trp Met Leu 



38 

Ser Asn Lys Trp 

Trp Ala Ala Ala 

140 

His Leu Ala Ser 
155 

Trp Leu Ala His 
170 

His Gly Asp Leu 
185 

Ser Val Gin Trp 



Asn Leu His Cys 

220 

Asp Thr Met Pro 
235 

Arg Glu Leu Gin 
250 

He Arg Asn Gin 
265 

Leu Ser Trp Leu 

Ala Ser Glu Asn 

300 

Pro Leu Leu Glu 
315 

Thr Val Ser Ser 



Phe Tyr Val Tyr 
125 

Cys Ala Leu Val 

Ala Val Met Leu 

160 

Asp Phe Leu His 
175 

Gly Gly Leu Phe 
190 

Trp Lys Asn Lys 
205 

Ser Ser Ala Val 

Leu Leu Ala Trp 

240 

Ala Asp Gly Lys 
255 

Ser Tyr Phe Tyr 
270 

Asn Glu Ser Phe 
285 

Ala Ala Leu Glu 



Lys Ala Gly He 

320 

Gly Phe Gly Arg 



0093/00064 



CA 02485060 2004-10-29 



39 



325 



330 



335 



Phe Ser Phe Ala Tyr Thr Ala Phe Tyr Phe Leu Thr Ala Thr Ala Ser 

340 345 350 

Cys Gly Phe Leu Leu Ala lie Val Phe Gly Leu Gly His Asn Gly Met 
355 360 365 

Ala Thr Tyr Asn Ala Asp Ala Arg Pro Asp Phe Trp Lys Leu Gin Val 
370 375 380 

Thr Thr Thr Arg Asn Val Thr Gly Gly His Gly Phe Pro Gin Ala Phe 
38*5 390 395 . 400 

Val Asp Trp Phe Cys Gly Gly Leu Gin Tyr Gin Val Asp His His Leu 

405 410 415 

Phe Pro Ser Leu Pro Arg His Asn Leu Ala Lys Thr His Ala Leu Val 

420 425 430 

Glu Ser Phe Cys Lys Glu Trp Gly Val Gin Tyr His Glu Ala Asp Leu 
435 440 445 

Val Asp Gly Thr Met Glu Val Leu His His Leu Gly Ser. Val Ala Gly 
450 455 460 



Glu Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 

465 470 475 



<210> 15 
<211> 1563 
<212> DNA 

<213> Ceratodon purpureus 

<220> 

<221> CDS 

<222> (1)..(1563) 

<223> A6-desaturase 
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<400> 15 - 
atg gtg tec cag ggc ggc ggt etc teg cag ggt tec att gaa gaa aac 48 
Met Val Ser Gin Gly Gly Gly Leu Ser Gin Gly Ser He Glu Glu Asn 
15 10 15 

att gac gtt gag cac ttg gca acg atg ccc etc gtc agt gac ttc eta 96 
lie Asp Val Glu His Leu Ala Thr Met Pro Leu Val Ser Asp Phe Leu 

20 25 30 

aat gtc ctg gga acg act ttg ggc cag tgg agt ctt tec act aca ttc 144 
Asn Val Leu Gly Thr Thr Leu Gly Gin Trp Ser Leu Ser Thr Thr Phe 

35 40 45 

get ttc aag agg etc acg act aag aaa cac agt teg gac ate teg gtg 192 
Ala Phe Lys Arg Leu Thr Thr Lys Lys His Ser Ser Asp He Ser Val 
50 55 60 

gag gca caa aaa gaa teg gtt gcg egg ggg cca gtt gag aat att tct 240 
Glu Ala Gin Lys Glu Ser Val Ala Arg Gly Pro Val Glu Asn He Ser 
65 70 75 80 

caa teg gtt gcg cag ccc ate agg egg agg tgg gtg cag gat aaa aag 288 
Gin Ser Val Ala Gin Pro He Arg Arg Arg Trp Val Gin Asp Lys Lys 

85 .90 '95 

ccg gtt act tac age ctg aag gat gta get teg cac gat atg ccc cag 336 
Pro Val Thr Tyr Ser Leu Lys Asp Val Ala Ser His Asp Met Pro Gin 

100 105 no 

gac tgc tgg att ata ate aaa gag aag gtg tat gat gtg age acc ttc 3 84 
Asp Cys Trp He lie He Lys Glu Lys Val Tyr Asp Val Ser Thr Phe 
115 120 125 

get gag cag cac cct gga ggc acg gtt ate aac acc tac ttc gga cga 432 
Ala Glu Gin His Pro Gly Gly Thr Val He Asn Thr Tyr Phe Gly Arg 
130 135 140 

gac gec aca gat gtt ttc tct act ttc cac gca tec acc tea tgg aag 480 
Asp Ala Thr Asp Val Phe Ser Thr Phe His Ala Ser Thr Ser Trp Lys 
145 150 155 160 
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att ctt cag aat ttc tac ate ggg aac ctt gtt agg gag gag ccg act 
He Leu Gin Asn Phe Tyr He Gly Asn Leu Val Arg Glu Glu Pro Thr 

165 170 175 



528 



ttg gag ctg ctg aag gag 
Leu Glu Leu Leu Lys Glu 

180 

gaa cag ctt ttc aag agt 
Glu Gin Leu Phe Lys Ser 
195 

ata aat gtt tec att gtt 
He Asn Val Ser He Val 
210 



tac aga gag ttg aga gec 
Tyr Arg Glu Leu Arg Ala 
185 

tec aaa tec tac tac ctt 
Ser Lys Ser Tyr Tyr Leu 
200 

gec aca age att gcg ata 
Ala Thr Ser He Ala lie 

215 220 



ctt ttc ttg aga 576 
Leu Phe Leu Arg 
190 

ttc aag act etc 624 
Phe Lys Thr Leu 
205 

ate agt ctg tac 672 
He Ser Leu Tyr 



aag tct tac egg 
Lys Ser Tyr Arg 
225 

att caa cag tgc 
He Gin Gin Cys 

ttt gag aca cgc 
Phe Glu Thr Arg 

260 



gcg gtt ctg tta tea 
Ala Val Leu Leu Ser 
230 

gga tgg ttg tct cac 
Gly Trp Leu Ser His 
245 

tgg etc aat gac gtt 
Trp Leu Asn Asp Val 

265 



gec agt ttg atg 
Ala Ser Leu Met 
235 

gat ttt eta cac 
Asp Phe Leu His 
250 

gtt ggc tat gtg 
Val Gly Tyr Val 



ggc ttg ttt 720 
Gly Leu Phe 
240 

cat cag gta 768 
His Gin Val 
255 

gtc ggc aac 816 
Val Gly Asn 
270 



gtt gtt ctg gga ttc agt gtc teg tgg tgg aag ace aag cac aac ctg 864 
Val val Leu Gly Phe Ser Val Ser Trp Trp Lys Thr Lys His Asn Leu 
275 280 285 



cat cat get get ccg aat gaa tgc gac caa aag tac aca ccg att gat 912 
His His Ala Ala Pro Asn Glu Cys Asp Gin Lys Tyr Thr Pro He Asp 
290 295 300 



gag gat att gat act etc ccc ate att get tgg agt aaa gat etc ttg 
Glu Asp He Asp Thr Leu Pro lie lie Ala Trp Ser Lys Asp Leu Leu 
305 310 315 320 



960 
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gcc act gtt gag age aag acc atg ttg cga gtt ctt cag tac cag cac 100 8 
Ala Thr Val Glu Ser Lys Thr Met Leu Arg Val Leu Gin Tyr Gin His 

325 330 335 

eta ttc ttt ttg gtt ctt ttg acg ttt gcc egg gcg agt tgg eta ttt 105 6 
Leu Phe Phe Leu Val Leu Leu Thr Phe Ala Arg Ala Ser Trp Leu Phe 

340 345 350 

tgg age gcg gcc ttc act etc agg ccc gag ttg acc ctt ggc gag aag 1104 
Trp Ser Ala Ala Phe Thr Leu Arg Pro Glu Leu Thr Leu Gly Glu Lys 
355 360 365 

ctt ttg gag agg gga acg atg get ttg cac tac att tgg ttt aat agt 1152 
Leu Leu Glu Arg Gly Thr Met Ala Leu His Tyr lie Trp Phe Asn Ser 
370 375 380 

gtt gcg ttt tat ctg etc ccc gga tgg aaa cca gtt gta tgg atg gtg 1200 
Val Ala Phe Tyr Leu Leu Pro Gly Trp Lys Pro Val Val Trp Met Val 
385 390 395 400 

gtc age gag etc atg tct ggt ttc ctg ctg gga tac gta ttt gta etc 1248 
Val Ser Glu Leu Met Ser Gly Phe Leu Leu Gly Tyr Val Phe Val Leu 

405 .410 415 

agt cac aat gga atg gag gtg tac aat acg tea aag gac ttc gtg aat 1296 
Ser His Asn Gly Met Glu Val Tyr Asn Thr Ser Lys Asp Phe Val Asn 

420 425 430 

gcc cag att gca teg act cgc gac ate aaa gca ggg gtg ttt aat gat 1344 
Ala Gin He Ala Ser Thr Arg Asp He Lys Ala Gly Val Phe Asn Asp 
435 440 445 

tgg ttc acc gga ggt etc aac aga cag att gag cat cat eta ttt cca 1392 
Trp Phe Thr Gly Gly Leu Asn Arg Gin He Glu His His Leu Phe Pro 

450 455 460 

acg atg ccc agg cac aac ctt aat aaa att tct cct cac gtg gag act 1440 
Thr Met Pro Arg His Asn Leu Asn Lys He Ser Pro His Val Glu Thr 
465 470 475 480 
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f 

ttg tgc aag aag cat gga ctg gtc tac gaa gac gtg age atg get teg 1488 
Leu Cys Lys Lys His Gly Leu Val Tyr Glu Asp Val Ser Met Ala Ser 

485 490 495 



ggc act tac egg gtt ttg aaa aca ctt aag gac gtt gec gat get get 1536 
Gly Thr Tyr Arg Val Leu Lys Thr Leu Lys Asp Val Ala Asp Ala Ala 

500 505 510 

tea cac cag cag ctt get gcg agt tga 1563 
Ser His Gin Gin Leu Ala Ala Ser 
515 520 



<210> 16 
<211> 520 
<212> PRT 
<213> Ceratodon 

<400> 16 
Met Val Ser Gin 
1 

lie Asp Val Glu 

20 

Asn Val Leu Gly 

35 

Ala Phe Lys Arg 
50 

Glu Ala Gin Lys 
65 

Gin Ser Val Ala 



purpureus 



Gly Gly Gly Leu 
5 

His Leu Ala Thr 



Thr Thr Leu Gly 

40 

Leu Thr Thr Lys 

55 

Glu Ser Val Ala 

70 

Gin Pro lie Arg 
85 



Ser Gin Gly Ser 
10 

Met Pro Leu Val 
25 

Gin Trp Ser Leu 



Lys His Ser Ser 

60 

Arg Gly Pro Val 

75 

Arg Arg Trp Val 
90 



lie Glu Glu Asn 

15 

Ser Asp Phe Leu 

30 

Ser Thr Thr Phe 
45 

Asp lie Ser Val 



Glu Asn lie Ser 

80 

Gin Asp Lys Lys 

95 



Pro Val Thr Tyr Ser Leu Lys Asp Val Ala Ser His Asp Met Pro Gin 

100 105 110 
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Asp Cys Trp lie 
115 

Ala Glu Gin His 
130 

Asp Ala Thr Asp 
145 

lie Leu Gin Asn 



Leu Glu Leu Leu 

160 

Glu Gin Leu Phe 
195 

lie Asn Val Ser 
210 

Lys Ser Tyr Arg 
225 

lie Gin Gin Cys 



Phe Glu Thr Arg 

260 

val Val Leu Gly 
275 

His His Ala Ala 
290 

Glu Asp lie Asp 
305 



lie lie Lys Glu 

120 

Pro Gly Gly Thr 
135 

Val Phe Ser Thr 
150 

Phe Tyr lie Gly 
165 

Lys Glu Tyr Arg 



Lys Ser Ser Lys 

200 

lie Val Ala Thr 
215 

Ala Val Leu Leu 
230 

Gly Trp Leu Ser 
245 

Trp Leu Asn Asp 



Phe Ser Val Ser 

280 

Pro Asn Glu Cys 
295 

Thr Leu Pro lie 
310 



44 

Lys Val Tyr Asp 



Val lie Asn Thr 

140 

Phe His Ala Ser 
155 

Asn Leu Val Arg 
170 

Glu Leu Arg Ala 
185 

Ser Tyr Tyr Leu 



Ser He Ala He 

220 

Ser Ala Ser Leu 
235 

His Asp Phe Leu 
250 

Val Val Gly Tyr 
265 

Trp Trp Lys Thr 



Asp Gin Lys Tyr 

300 

He Ala Trp Ser 
315 



Val Ser Thr Phe 
125 

Tyr Phe Gly Arg 



Thr Ser Trp Lys 

160 

Glu Glu Pro Thr 
175 

Leu Phe Leu Arg 
190 

Phe Lys Thr Leu 
205 

He Ser Leu Tyr 

Met Gly Leu Phe 

240 

His His Gin Val 
255 

Val Val Gly Asn 
270 

Lys His Asn Leu 
285 

Thr Pro He Asp 



Lys Asp Leu Leu 

320 
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Ala Thr Val Glu 



Leu Phe Phe Leu 

340 

Trp Ser Ala Ala 
355 

■ 

Leu Leu Glu Arg 
370 

Val Ala Phe Tyr 
385 

Val Ser Glu Leu 



Ser His Asn Gly 

420 

Ala Gin lie Ala 
435 

Trp Phe Thr Gly 
450 

Thr Met Pro Arg 

465 

Leu Cys Lys Lys 



Gly Thr Tyr Arg 

500 

Ser His Gin Gin 
515 



Ser Lys Thr Met 
325 

Val Leu Leu Thr 



Phe Thr Leu Arg 

360 

Gly Thr Met Ala 
375 

Leu Leu Pro Gly 
390 

Met Ser Gly Phe 
405 

Met Glu Val Tyr 



Ser Thr Arg Asp 

440 

Gly Leu Asn Arg 
455 

His Asn Leu Asn 
470 

His Gly Leu Val 
485 

Val Leu Lys Thr 



Leu Ala Ala Ser 

520 



45 

Leu Arg Val Leu 
330 

Phe Ala Arg Ala 
345 

Pro Glu Leu Thr 



Leu His Tyr lie 

380 

Trp Lys Pro Val 
395 

Leu Leu Gly Tyr 
410 

Asn Thr Ser Lys 
425 

lie Lys Ala Gly 



Gin lie Glu His 

460 

Lys lie Ser Pro 
475 

Tyr Glu Asp Val 
490 

Leu Lys Asp Val 
505 



Gin Tyr Gin His 
335 

Ser Trp Leu Phe 
350 

Leu Gly Glu Lys 

365 

Trp Phe Asn Ser 



Val Trp Met Val 

400 

Val Phe Val Leu 
415 

Asp Phe Val Asn 
430 

Val Phe Asn Asp 
445 

His Leu Phe Pro 

His Val Glu Thr 

480 

Ser Met Ala Ser 
495 

Ala Asp Ala Ala 
510 
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<210> 17 
<211> 1578 
<212> DNA 

<213> Physcomitrella patens 

<220> 

<221> CDS 

<222> (!)..( 1578) 

<223> A6-desaturase 

<400> 17 

atg gta ttc gcg ggc ggt gga ctt cag cag ggc tct etc gaa gaa aac 48 
Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
15 10 15 

ate gac gtc gag cac att gec agt atg tct etc ttc age gac ttc ttc 96 
lie Asp Val Glu His lie Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 

20 25 30 



agt tat gtg 
Ser Tyr Val 

35 

cct ttg aag 
Pro Leu Lys 
50 

gtg caa tgt 
Val Gin Cys 
65 

act gcg gag 
Thr Ala Glu 

tea tct cag 
Ser Ser Gin 



tct tea act gtt 
Ser Ser Thr Val 

cgc ctg acg agt 
Arg Leu Thr Ser 

55 

ata tea get gaa 
lie Ser Ala Glu 
70 

gca etc gca gaa 
Ala Leu Ala Glu 
85 

tgg aag aag teg 
Trp Lys Lys Ser 
100 



ggt teg tgg 
Gly Ser Trp 
40 

aag aag cgt 
Lys Lys Arg 

gtt cag aga 
Val Gin Arg 

tea gtc gtg 
Ser Val Val 

90 

aca cac ccc 
Thr His Pro 
105 



age gta cac agt 
Ser Val His Ser 

45 

gtt teg gaa age 
Val Ser Glu Ser 
60 

aat teg agt acc 
Asn Ser Ser Thr 
75 

aag ccc acg aga 
Lys Pro Thr Arg 

eta tea gaa gta 
Leu Ser Glu Val 

110 



ata caa 144 
lie Gin 



get gec 192 
Ala Ala 



cag gga 240 
Gin Gly 
80 

cga agg 288 
Arg Arg 
95 

gca gta 336 
Ala Val 



cac aac aag cca age gat tgc tgg att gtt gta aaa aac aag gtg tat 384 
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His Asn Lys 
115 

gat gtt tec 
Asp Val Ser 
130 

act tat ttt 
Thr Tyr Phe 
145 

get tct aca 
Ala Ser Thr 



agg gtg gag 
Arg Val Glu 

get ctt ttc 
Ala Leu Phe 
195 

gtt atg aag 

Val Met Lys 
210 

ata ata tgt 
lie lie Cys 
225 

atg atg get 
Met Met Ala 

etc cac aat 
Leu His Asn 

tat gtg ate 



Pro Ser Asp 

aat ttt gcg 
Asn Phe Ala 

gga cga gac 
Gly Arg Asp 
150 

tgg aaa att 
Trp Lys lie 
165 

ccg act cca 
Pro Thr Pro 
180 

ctg agg gag 
Leu Arg Glu 

ctg etc acg 
Leu Leu Thr 



tgg age aag 
Trp Ser Lys 
230 

ctg tgt ttc 
Leu Cys Phe 
245 

cag gtg ttt 
Gin Val Phe 
260 

ggc aac gec 



Cys Trp lie 
120 

gac gag cat 
Asp Glu His 
135 

ggc aca gat 
Gly Thr Asp 

ctt caa gac 
Leu Gin Asp 

gag ctg ctg 
Glu Leu Leu 
185 

caa ctt ttc 
Gin Leu Phe 
200 

aat gtt get 
Asn Val Ala 
215 

act att tea 
Thr lie Ser 



caa cag tgc 
Gin Gin Cys 



gag aca cgc 
Glu Thr Arg 
265 

gtt ctg ggg 



47 

Val Val Lys 

ccc gga gga 
Pro Gly Gly 
140 

gtt ttc tct 
Val Phe Ser 
155 

ttt tac att 
Phe Tyr lie 
170 

aaa gat ttc 
Lys Asp Phe 

aaa agt teg 
Lys Ser Ser 

att ttt get 
He Phe Ala 
220 

gcg gtt ttg 
Ala Val Leu 
235 

gga tgg eta 
Gly Trp Leu 
250 

tgg ctt aat 
Trp Leu Asn 

ttt agt aca 



Ash Lys Val 
125 

tea gtt att 
Ser Val He 



agt ttt cat 
Ser Phe His 

ggt gac gtg 
Gly Asp Val 
175 

cga gaa atg 
Arg Glu Met 
190 

aaa ttg tac 
Lys Leu Tyr 
205 

gcg age att 
Ala Ser He 



get tea get 
Ala Ser Ala 

tec cat gat 
Ser His Asp 
255 

gaa gtt gtc 
Glu Val val 
270 

ggg tgg tgg 



Tyr 



agt 432 
Ser 



gca 480 

Ala 

160 

gag 528 
Glu 



aga 576 
Arg 

tat 624 
Tyr 

gca 672 
Ala 

tgt 720 

Cys 

240 

ttt 768 
Phe 



ggg 816 
Gly 

aag 864 
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Tyr Val lie Gly Asn Ala Val Leu Gly Phe Ser Thr Gly Trp Trp Lys 
275 280 285 

gag aag cat aac ctt cat cat get get cca aat gaa tgc gat cag act 912 
Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 295 300 

tac caa cca att gat gaa gat att gat act etc ccc etc att gee tgg 960 
Tyr Gin Pro He Asp Glu Asp lie Asp Thr Leu Pro Leu He Ala Trp 
305 310 315 320 

age aag gac ata ctg gee aca gtt gag aat aag aca ttc ttg cga ate 1008 
Ser Lys Asp He Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg He 

325 330 335 

etc caa tac cag cat ctg ttc ttc atg ggt ctg tta ttt ttc gee cgt 1056 
Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 

340 345 350 

ggt agt tgg etc ttt tgg age tgg aga tat ace tct aca gca gtg etc 1104 
Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 
355 360 365 

tea cct gtc gac agg ttg ttg gag aag gga act gtt ctg ttt cac tac 1152 
Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
370 375 380 

ttt tgg ttc gtc ggg aca gcg tgc tat ctt etc cct ggt tgg aag cca 1200 
Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385 390 395 400 

tta gta tgg atg gcg gtg act gag etc atg tec ggc atg ctg ctg ggc 124 8 
Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 

405 410 415 

ttt gta ttt gta ctt age cac aat ggg atg gag gtt tat aat teg tct 1296 
Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 

420 425 430 



aaa gaa ttc gtg agt gca cag ate gta tec aca egg gat ate aaa gga 



1344 
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Lys Glu Phe Val Ser Ala Gin He Val Ser Thr Arg Asp He Lys Gly 
435 440 445 

aac ata ttc aac gac tgg ttc act ggt ggc ctt aac agg caa ata gag 1392 
Asn He Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gin He Glu 
450 455 460 

cat cat ctt ttc cca aca atg ccc agg cat aat tta aac aaa ata gca 144 0 
His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Lys He Ala 
465 470 475 480 

cct aga gtg gag gtg ttc tgt aag aaa cac ggt ctg gtg tac gaa gac 1488 
Pro Arg Val Glu Val Phe Cys Lys Lys His Gly Leu Val Tyr Glu Asp 

485 490 495 

gta tct att get acc ggc act tgc aag gtt ttg aaa gca ttg aag gaa 1536 
Val Ser He Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 

500 505 510 

gtc gcg gag get gcg gca gag cag cat get acc acc agt taa 1578 
Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr Thr Ser 
515 520 525 



<210> 18 
<211> 525 
<212> PRT 

<213> Physcomitrella patens 
<400> 18 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
15 10 15 

lie Asp Val Glu His He Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 

20 25 30 

Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser Val His Ser He Gin 
35 40 45 



Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
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Val Gin Cys lie 
65 

Thr Ala Glu Ala 



Ser Ser Gin Trp 

100 

His Asn Lys Pro 
115 

Asp Val Ser Asn 
130 

Thr Tyr Phe Gly 
145 

Ala Ser Thr Trp 



Arg Val Glu Pro 

180 

Ala Leu Phe Leu 
195 

Val Met Lys Leu 
210 

lie lie Cys Trp 
225 

Met Met Ala Leu 



Ser Ala Glu Val 
70 

Leu Ala Glu Ser 
85 

Lys Lys Ser Thr 



Ser Asp Cys Trp 

120 

Phe Ala Asp Glu 
135 

Arg Asp Gly Thr 
150 

Lys lie Leu Gin 
165 

Thr Pro Glu Leu 



Arg Glu Gin Leu 

200 

Leu Thr Asn Val 
215 

Ser Lys Thr He 
230 

Cys Phe Gin Gin 
245 



50 

60 

Gin Arg Asn Ser 

75 

Val Val Lys Pro 
90 

His Pro Leu Ser 
105 

He Val Val Lys 

His Pro Gly Gly 

140 

Asp Val Phe Ser 
155 

Asp Phe Tyr He 
170 

Leu Lys Asp Phe 
185 

Phe Lys Ser Ser 



Ala He Phe Ala 

220 

Ser Ala Val Leu 
235 

Cys Gly Trp Leu 
250 



Ser Thr Gin Gly 

80 

Thr Arg Arg Arg 

95 

Glu Val Ala Val 
110 

Asn Lys Val Tyr 
125 

Ser val He Ser 



Ser Phe His Ala 

160 

Gly Asp Val Glu 
175 

Arg Glu Met Arg 
190 

Lys Leu Tyr Tyr 

205 

Ala Ser He Ala 



Ala Ser Ala Cys 

240 

Ser His Asp Phe 
255 



Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 

260 265 270 
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Tyr Val He Gly 
275 

Glu Lys His Asn 
290 

Tyr Gin Pro He 
305 

Ser Lys Asp lie 

Leu Gin Tyr Gin 

340 

Gly Ser Trp Leu 
355 

Ser Pro Val Asp 
370 

Phe Trp Phe Val 
385 

Leu Val Trp Met 
Phe val Phe Val 

420 

Lys Glu Phe Val 
435 

Asn lie Phe Asn 
450 

His His Leu Phe 
465 



Asn Ala Val Leu 

280 

Leu His His Ala 
295 

Asp Glu Asp He 
310 

Leu Ala Thr Val 
325 

His Leu Phe Phe 



Phe Trp Ser Trp 

360 

Arg Leu Leu Glu 
375 

Gly Thr Ala Cys 
390 

Ala Val Thr Glu 
405 

Leu Ser His Asn 



Ser Ala Gin He 

440 

Asp Trp Phe Thr 
455 

Pro Thr Met Pro 
470 



51 

Gly Phe Ser Thr 



Ala Pro Asn Glu 

300 

Asp Thr Leu Pro 
315 

Glu Asn Lys Thr 
330 

Met Gly Leu Leu 
345 

Arg Tyr Thr Ser 



Lys Gly Thr Val 

380 

Tyr Leu Leu Pro 
395 

Leu Met Ser Gly 
410 

Gly Met Glu Val 
425 

Val Ser Thr Arg 



Gly Gly Leu Asn 

460 

Arg His Asn Leu 
475 



Gly Trp Trp Lys 
285 

Cys Asp Gin Thr 



Leu He Ala Trp 

320 

Phe Leu Arg He 
335 

Phe Phe Ala Arg 
350 

Thr Ala Val Leu 
365 

Leu Phe His Tyr 



Gly Trp Lys Pro 

400 

Met Leu Leu Gly 
415 

Tyr Asn Ser Ser 
430 

Asp He Lys Gly 
445 

Arg Gin He Glu 



Asn Lys He Ala 

480 
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Pro Arg Val Glu 



Val Ser lie Ala 

500 

Val Ala Glu Ala 
515 



Val Phe Cys Lys 
485 

Thr Gly Thr Cys 



Ala Ala Glu Gin 

520 



52 

Lys His Gly Leu 
490 

Lys Val Leu Lys 
505 

His Ala Thr Thr 



Val Tyr Glu Asp 
495 

Ala Leu Lys Glu 
510 

Ser 
525 



<210> 19 
<211> 837 
<212> DNA 

<213> Phytophthora infestans 

<220> 

<221> CDS 

<222> (1)..<837) 

<223> A6-elongase 



<400> 19 

atg teg act gag eta ctg cag age tac tac gcg tgg gec aac gec acg 4 8 
Met Ser Thr Glu Leu Leu Gin Ser Tyr Tyr Ala Trp Ala Asn Ala Thr 
1 5 10 15 

gag gee aag ctg ctg gac tgg gtc gac cct gag ggc ggc tgg aag gtg 9 6 
Glu Ala Lys Leu Leu Asp Trp Val Asp Pro Glu Gly Gly Trp Lys Val 

20 25 30 

cat cct atg gca gac tac ccc eta gec aac ttc tec age gtc tac gec 144 
His Pro Met Ala Asp Tyr Pro Leu Ala Asn Phe Ser Ser Val Tyr Ala 
35 40 45 



ate tgc gtc gga tac ttg etc ttc gta ate ttc ggc acg gec ctg atg 192 

lie Cys Val Gly Tyr Leu Leu Phe Val He Phe Gly Thr Ala Leu Met 
50 55 60 

aaa atg gga gtc ccc gee ate aag acc agt cca tta cag ttt gtg tac 240 

Lys Met Gly Val Pro Ala He Lys Thr Ser Pro Leu Gin Phe Val Tyr 
65 70 75 ao 
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aac ccc ate caa gtc att gec tgc tct tat atg tgc gtg gag gee gec 2 88 

Asn Pro He Gin Val He Ala Cys Ser Tyr Met Cys Val Glu Ala Ala 

85 90 95 

ate cag gec tac cgc aac ggc tac ace gee gec ccg tgc aac gec ttt 336 

He Gin Ala Tyr Arg Asn Gly Tyr Thr Ala Ala Pro Cys Asn Ala Phe 

100 105 110 

aag tec gac gac ccc gtc atg ggc aac gtt ctg tac etc ttc tat etc 384 

Lys Ser Asp Asp Pro Val Met Gly Asn Val Leu Tyr Leu Phe Tyr Leu 
115 120 125 

» 

tec aag atg etc gac ctg tgc gac aca gtc ttc att ate eta gga aag 4 32 

Ser Lys Met Leu Asp Leu Cys Asp Thr Val Phe He He Leu Gly Lys 
130 135 140 

■ 

aag tgg aaa cag ctt tec ate ttg cac gtg tac cac cac ctt acc gtg 480 

Lys Trp Lys Gin Leu Ser He Leu His Val Tyr His His Leu Thr Val 

145 150 155 160 

ctt ttc gtc tac tat gtg acg ttc cgc gec get cag gac ggg gac tea 528 

Leu Phe Val Tyr Tyr Val Thr Phe Arg Ala Ala Gin Asp Gly Asp Ser 

165 170 175 

tat get acc ate gtg etc aac ggc ttc gtg cac acc ate atg tac act 576 

Tyr Ala Thr He Val Leu Asn Gly Phe Val His Thr He Met Tyr Thr 

180 185 190 

tac tac ttc gtc age gee cac acg cgc aac att tgg tgg aag aag tac 624 

Tyr Tyr Phe Val Ser Ala His Thr Arg Asn He Trp Trp Lys Lys Tyr 
195 200 205 

etc acg cgc att cag ctt ate cag ttc gtg acc atg aac gtg cag ggc 672 

Leu Thr Arg He Gin Leu He Gin Phe Val Thr Met Asn Val Gin Gly 
210 215 220 

tac ctg acc tac tct cga cag tgc cca ggc atg cct cct aag gtg ccg 720 

Tyr Leu Thr Tyr Ser Arg Gin Cys Pro Gly Met Pro Pro Lys Val Pro 

225 230 235 240 
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etc atg tac ctt 
Leu Met Tyr Leu 

ttc tac att cgc 
Phe Tyr lie Arg 

260 

gaa teg aag aag 
Glu Ser Lys Lys 
275 



gtg tac gtg cag tea 
Val Tyr Val Gin Ser 
245 

gcg tac gtg ttc ggc 
Ala Tyr Val Phe Gly 

265 

aag ttg taa 
Lys Leu 



54 

etc ttc tgg etc 
Leu Phe Trp Leu 

250 . 

ccc aag aaa ccg 
Pro Lys Lys Pro 



ttc atg aat 768 
Phe Met Asn 
255 

gec gtg gag 816 
Ala Val Glu 
270 

837 



<210> 20 
<211> 278 
<212> PRT 

<213> Phytophthora infestans 
<400> 20 

Met Ser Thr Glu Leu Leu Gin Ser Tyr Tyr Ala Trp Ala Asn Ala Thr 
15 10 15 

Glu Ala Lys Leu Leu Asp Trp Val Asp Pro Glu Gly Gly. Trp Lys Val 

20 25 30 

His Pro Met Ala Asp Tyr Pro Leu Ala Asn Phe Ser Ser Val Tyr Ala 

35 40 45 

lie Cys Val Gly Tyr Leu Leu Phe Val lie Phe Gly Thr Ala Leu Met 
50 55 60 

Lys Met Gly Val Pro Ala He Lys Thr Ser Pro Leu Gin Phe Val Tyr 
65 70 75 80 

Asn Pro He Gin Val He Ala Cys Ser Tyr Met Cys Val Glu Ala Ala 

85 . 90 95 

He Gin Ala Tyr Arg Asn Gly Tyr Thr Ala Ala Pro Cys Asn Ala Phe 

100 105 no 
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Lys Ser Asp Asp Pro Val Met Gly Asn Val Leu Tyr Leu Phe Tyr Leu 
115 120 125 

Ser Lys Met Leu Asp Leu Cys Asp Thr Val Phe lie He Leu Gly Lys 
130 135 140 

Lys Trp Lys Gin Leu Ser He Leu His Val Tyr His His Leu Thr Val 
145 ISO 155 160 

Leu Phe Val Tyr Tyr Val Thr Phe Arg Ala Ala Gin Asp Gly Asp Ser 

165 170 175 

♦ 

Tyr Ala Thr He Val Leu Asn Gly Phe Val His Thr He Met Tyr Thr 

180 185 190 

Tyr Tyr Phe Val Ser Ala His Thr Arg Asn He Trp Trp Lys Lys Tyr 
195 200 205 

Leu Thr Arg He Gin Leu lie Gin Phe Val Thr Met Asn Val Gin Gly 
210 215 220 

Tyr Leu Thr Tyr Ser Arg Gin Cys Pro Gly Met Pro Pro Lys Val Pro 
225 230 m 235 240 

Leu Met Tyr Leu Val Tyr Val Gin Ser Leu Phe Trp Leu Phe Met Asn 

245 250 255 

Phe Tyr lie Arg Ala Tyr Val Phe Gly Pro Lys Lys Pro Ala Val Glu 

260 265 270 

Glu Ser Lys Lys Lys Leu 
275 



<210> 21 
<211> 1410 
<212> DNA 

<213> Phaeodactylum tricornutum 
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<220> 

<221> CDS 

<222> (1)..(1410) 

<22 3> A5-desaturase 



<400> 21 

atg get ccg gat gcg gat aag ctt 
Met Ala Pro Asp Ala Asp Lys Leu 
1 5 



cga caa cgc cag acg act gcg gta 48 
Arg Gin Arg Gin Thr Thr Ala Val 
10 15 



gcg aag cac aat get get ace ata teg acg cag gaa cgc ctt tgc agt 96 
Ala Lys His Asn Ala Ala Thr lie Ser Thr Gin Glu Arg Leu Cys Ser 

20 25 30 

ctg tct teg etc aaa ggc gaa gaa gtc tgc ate gac gga ate ate tat 144 
Leu Ser Ser Leu Lys Gly Glu Glu Val Cys lie Asp Gly He He Tyr . 

35 40 45 

gac etc caa tea ttc gat cat ccc ggg ggt gaa acg ate aaa atg ttt 192 
Asp Leu Gin Ser Phe Asp His Pro Gly Gly Glu Thr He Lys Met Phe 
50 55 60 



ggt ggc aac gat gtc act gta cag 
Gly Gly Asn Asp val Thr Val Gin 
65 70 

acc gag aag cat ttg gaa aag atg 
Thr Glu Lys His Leu Glu Lys Met 

85 



tac aag atg att cac ccg tac cat 240 
Tyr Lys Met He His Pro Tyr His 
75 80 

aag cgt gtc ggc aag gtg acg gat 288 
Lys Arg Val Gly Lys Val Thr Asp 
90 95 



ttc gtc tgc gag tac aag ttc gat acc gaa ttt gaa cgc gaa ate aaa 
Phe Val Cys Glu Tyr Lys Phe Asp Thr Glu Phe Glu Arg Glu He Lys 

100 105 no 



336 



cga gaa gtc ttc aag att gtg cga cga ggc aag gat ttc ggt act ttg 
Arg Glu Val Phe Lys He Val Arg Arg Gly Lys Asp Phe Gly Thr Leu 
115 120 125 



364 



gga tgg ttc ttc cgt gcg ttt tgc tac att gee att ttc ttc tac ctg 
Gly Trp Phe Phe Arg Ala Phe Cys Tyr He Ala He Phe Phe Tyr Leu 



432 
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130 135 140 

cag tac cat tgg gtc acc acg gga acc tct tgg ctg ctg gcc gtg gcc 480 

Gin Tyr His Trp Val Thr Thr Gly Thr Ser Trp Leu Leu Ala Val Ala 
145 150 155 160 

tac gga ate tec caa gcg atg att ggc atg aat gtc cag cac gat gcc 528 

Tyr Gly He Ser Gin Ala Met He Gly Met Asn Val Gin His Asp Ala 

165 170 175 

aac cac ggg gcc acc tec aag cgt ccc tgg gtc aac gac atg eta ggc 576 

Asn His Gly Ala Thr ser Lys Arg Pro Trp Val A6n Asp Met Leu Gly 

180 185 190 

etc ggt gcg gat ttt att ggt ggt tec aag tgg etc tgg cag gaa caa 624 

Leu Gly Ala Asp Phe lie Gly Gly Ser Lys Trp Leu Trp Gin Glu Gin 
195 200 205 

cac tgg acc cac cac get tac acc aat cac gcc gag atg gat ccc gat 672 

His Trp Thr His His Ala Tyr Thr Asn His Ala Glu Met Asp Pro Asp 
210 215 220 

age ttt ggt gcc gaa cca atg etc eta ttc aac gac tat ccc ttg gat 720 

Ser Phe Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Asp 
225 230 235 240 

cat ccc get cgt acc tgg eta cat cgc ttt caa gca ttc ttt tac atg 768 

His Pro Ala Arg Thr Trp Leu His Arg Phe Gin Ala Phe Phe Tyr Met 

245 250 255 

ccc gtc ttg get gga tac tgg ttg tec get gtc ttc aat cca caa att 816 

Pro Val Leu Ala Gly Tyr Trp Leu Ser Ala Val Phe Asn Pro Gin He 

260 265 270 

ctt gac etc cag caa cgc ggc gca ctt tec gtc ggt ate cgt etc gac 864 

Leu Asp Leu Gin Gin Arg Gly Ala Leu Ser Val Gly He Arg Leu Asp 
275 280 285 



aac get ttc att cac teg cga cgc aag tat gcg gtt ttc tgg egg get 912 
Asn Ala Phe He His Ser Arg Arg Lys Tyr Ala Val Phe Trp Arg Ala 
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gtg tac att 
Val Tyr lie 
305 

etc gaa tgg 
Leu Glu Trp 

gcg gaa teg 
Ala Glu Ser 



gaa tec gcg 
Glu Ser Ala 
355 

cca gtc gac 
Pro Val Asp 
370 

gga ttc ctt 
Gly Phe Leu 
385 

cac cac ttg 
His His Leu 

ccc aag gtc 
Pro Lys Val 

tac ccg tgg 
Tyr Pro Trp 
435 

gcg gec ggg 
Ala Ala Gly 



gcg gtg aac 
Ala Val Asn 
310 

tec tgg cgt 
Ser Trp Arg 
325 

etc gcg ctg 
Leu Ala Leu 
340 

gat cgc gat 
Asp Arg Asp 

tgg ttc aag 
Trp Phe Lys 

tec ggt tgc 
Ser Gly Cys 
390 

ttc cca cgc 
Phe Pro Arg 
405 

cgc gaa att 
Arg Glu lie 
420 

ate cac caa 
He His Gin 



acc ggt gee 
Thr Gly Ala 



295 

gtg att get 
Val He Ala 



gtc ttt gga 
Val Phe Gly 



gcg gtc ctg 
Ala Val Leu 
345 

ccg acc gec 
Pro Thr Ala 
360 

aca cag gtc 
Thr Gin Val 
375 

ttc acg gga 
Phe Thr Gly 

atg age age 
Met Ser Ser 

tgc gec aaa 
Cys Ala Lys 
425 

aac ttt etc 
Asn Phe Leu 
440 

aac tgg cgc 
Asn Trp Arg 



58 

300 

ccg ttt tac 
Pro Phe Tyr 
315 

aac ate atg 
Asn He Met 
330 

ttt teg ttg 
Phe Ser Leu 

cca ctg aaa 
Pro Leu Lys 

gaa act tec 
Glu Thr Ser 
380 

ggt etc aac 
Gly Leu Asn 
395 

get tgg tat 
Ala Trp Tyr 
410 

cac ggc gtc 
His Gly Val 

tec acc gtc 
Ser Thr Val 



cag atg gee 
Gin Met Ala 



aca aac tec 
Thr Asn Ser 

etc atg ggt 
Leu Met Gly 
335 

teg cac aat 
Ser His Asn 
350 

aag acg gga 
Lys Thr Gly 
365 

tgc act tac 
Cys Thr Tyr 

ttt cag gtt 
Phe Gin Val 



ccc tac att 
Pro Tyr He 
415 

cac tac gee 
His Tyr Ala 
430 

cgc tac atg 
Arg Tyr Met 
445 

aga gaa aat 
Arg Glu Asn 



ggc 960 

Gly 

320 

gtg 1008 
Val 



ttc 1056 
Phe 



gaa 1104 
Glu 



ggt 1152 

Gly 



gaa 1200 

Glu 

400 

gec 124 8 
Ala 



tac 1296 
Tyr 

cac 1344 
His 



ccc 1392 
Pro 
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450 455 460 

ttg acc gga egg gcg taa 1410 
Leu Thr Gly Arg Ala 
465 470 



<210> 22 
<211> 469 
<212> PRT 

<213> Phaeodactylum tricornutum 
<4t)0> 22 

Met Ala Pro Asp Ala Asp Lys Leu Arg Gin Arg Gin Thr Thr Ala Val 
15 10 15 

Ala Lys His Asn Ala Ala Thr lie Ser Thr Gin Glu Arg Leu Cys Ser 

20 25 30 



' Leu Ser Ser Leu 

35 

Asp Leu Gin Ser 
50 

Gly Gly Asn Asp 
65 

Thr Glu Lys His 



Lys Gly Glu Glu 

40 

Phe Asp His Pro 

55 

Val Thr Val Gin 
70 

Leu Glu Lys Met 
85 



Val Cys lie Asp 



Gly Gly Glu Thr 

60 

Tyr Lys Met lie 

75 

Lys Arg Val Gly 
90 



Gly lie lie Tyr 
45 

lie Lys Met Phe 



His Pro Tyr His 

80 

Lys Val Thr Asp 

95 



Phe Val Cys Glu 

100 

Arg Glu Val Phe 
115 



Tyr Lys Phe Asp 



Lys lie Val Arg 

120 



Thr Glu Phe Glu 
105 

Arg Gly Lys Asp 



Arg Glu lie Lys 
110 

Phe Gly Thr Leu 
125 



Gly Trp Phe Phe Arg Ala Phe Cys Tyr lie Ala lie Phe Phe Tyr Leu 
130 135 140 
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Gin Tyr His Trp Val 
145 

Tyr Gly He Ser Gin 

165 

Asn His Gly Ala Thr 

180 

Leu Gly Ala Asp Phe 
195 

His Trp Thr His His 
210 

Ser Phe Gly Ala Glu 
225 

His Pro Ala Arg Thr 

245 

Pro Val Leu Ala Gly 

260 

Leu Asp Leu Gin Gin 
275 

Asn Ala Phe He His 
290 

Val Tyr He Ala Val 
305 

Leu Glu Trp Ser Trp 

325 

Ala Glu Ser Leu Ala 

340 



60 

Thr Thr Gly Thr Ser Trp 
150 155 

Ala Met He Gly Met Asn 

170 

Ser Lys Arg Pro Trp Val 

185 

He Gly Gly Ser Lys Trp 
200 

Ala Tyr Thr Asn His Ala 
215 

Pro Met Leu Leu Phe Asn 
230 235 

Trp Leu His Arg Phe Gin 

250 

Tyr Trp Leu Ser Ala Val 

265 

Arg Gly Ala Leu Ser Val 
280 

Ser Arg Arg Lys Tyr Ala 
295 

Asn Val He Ala Pro Phe 
310 315 

Arg Val Phe Gly Asn He 

330 

Leu Ala Val Leu Phe Ser 

345 



Leu Leu Ala Val Ala 

160 

Val Gin His Asp Ala 

175 

Asn Asp Met Leu Gly 
190 

Leu Trp Gin Glu Gin 
205 

Glu Met Asp Pro Asp 
220 

Asp Tyr Pro Leu Asp 

240 

Ala Phe Phe Tyr Met 

255 

Phe Asn Pro Gin He 
270 

Gly He Arg Leu Asp 
285 

Val Phe Trp Arg Ala 
300 

Tyr Thr Asn Ser Gly 

320 

Met Leu Met Gly Val 

335 

Leu Ser His Asn Phe 
350 



Glu Ser Ala Asp Arg Asp Pro Thr Ala Pro Leu Lys Lys Thr Gly Glu 
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355 



360 



365 • 



Pro Val Asp Trp Phe Lys Thr Gin Val Glu Thr Ser Cys Thr Tyr Gly 
370 375 380 

Gly Phe Leu Ser Gly Cys Phe Thr Gly Gly Leu Asn Phe Gin Val Glu 
385 390 395 400 

His His Leu Phe Pro Arg Met Ser Ser Ala Trp Tyr Pro Tyr lie Ala 

405 410 415 

Pro Lys Val Arg Glu He Cys Ala Lys His Gly Val His Tyr Ala Tyr 

420 425 430 

♦ 

Tyr Pro Trp He His Gin Asn Phe Leu Ser Thr Val Arg Tyr Met His 

435 440 445 

Ala Ala Gly Thr Gly Ala Asn Trp Arg Gin Met Ala Arg Glu Asn Pro 
450 455 460 



Leu Thr Gly Arg Ala 
465 



<210> 23 
<211> 1344 
<212> DNA 

<213> Caenorhabditis elegans 

<220> 

<221> CDS 

<222> (1)..{1344) 

<223> A5-desaturase 

<400> 23 

atg gta tta cga gag caa gag cat gag cca ttc ttc att aaa att gat 48 

Met Val Leu Arg Glu Gin Glu His Glu Pro Phe Phe He Lys He Asp 
15 10 15 

gga aaa tgg tgt caa att gac gat get gtc ctg aga tea cat cca ggt 96 
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Gly Lys Trp Cys Gin lie Asp Asp Ala Val Leu Arg Ser His Pro Gly 

20 25 30 

ggt agt gca att act acc tat aaa aat atg gat gcc act acc gta ttc 144 
Gly Ser Ala He Thr Thr Tyr Lys Asn Met Asp Ala Thr Thr Val Phe 
35 40 45 



cac aca ttc cat act ggt tct aaa gaa gcg tat caa tgg ctg aca gaa 192 
His Thr Phe His Thr Gly Ser Lys Glu Ala Tyr Gin Trp Leu Thr Glu 
50 55 60 



ttg aaa aaa gag tgc cct aca caa 
Leu Lys Lys Glu Cys Pro Thr Gin 
65 70 

gat gac cca ate aaa gga att gat 
Asp Asp Pro He Lys Gly He Asp 

85 



gaa cca gag ate cca gat att aag 240 
Glu Pro Glu He Pro Asp He Lys 
75 80 

gat gtg aac atg gga act ttc aat 2 88 
Asp Val Asn Met Gly Thr Phe Asn 
90 95 



att tct gag aaa cga tct gcc caa ata aat aaa agt ttc act gat eta 336 
He Ser Glu Lys Arg Ser Ala Gin He Asn Lys Ser Phe Thr Asp Leu 

100 105 no 

cgt atg cga gtt cgt gca gaa gga ctt atg gat gga tct cct ttg ttc 384 
Arg Met Arg Val Arg Ala Glu Gly Leu Met Asp Gly Ser Pro Leu Phe 
115 120 125 

tac att aga aaa att ctt gaa aca ate ttc aca att ctt ttt gca ttc 4 32 
Tyr He Arg Lys He Leu Glu Thr He Phe Thr He Leu Phe Ala Phe 
130 135 140 



tac ctt caa tac cac aca tat tat ctt cca tea get att eta atg gga 480 
Tyr Leu Gin Tyr His Thr Tyr Tyr Leu Pro Ser Ala lie Leu Met Gly 
145 150 155 160 



gtt gcg tgg caa caa ttg gga tgg tta ate cat gaa ttc gca cat cat 528 
Val Ala Trp Gin Gin Leu Gly Trp Leu He His Glu Phe Ala His His 

165 170 175 



cag ttg ttc aaa aac aga tac tac aat gat ttg gcc age tat ttc gtt 



576 
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Gin Leu Phe Lys Asn Arg Tyr Tyr Asn Asp Leu Ala Ser Tyr Phe Val 

180 185 190 

gga aac ttt tta caa gga ttc tea tct ggt ggt tgg aaa gag cag cac 624 
Gly Asn Phe Leu Gin Gly Phe Ser Ser Gly Gly Trp Lys Glu Gin His 
195 200 205 

aat gtg cat cac gca gec aca aat gtt gtt gga cga gac gga gat ctt 672 
Asn Val His His Ala Ala Thr Asn Val Val Gly Arg Asp Gly Asp Leu 
210 215 220 

gat tta gtc cca ttc tat get aca gtg gca gaa cat etc aac aat tat 720 
Asp Leu Val Pro Phe Tyr Ala Thr Val Ala Glu His Leu Asn Asn Tyr 
225 230 235 240 

tct cag gat tea tgg gtt atg act eta ttc aga tgg caa cat gtt cat 7 68 
Ser Gin Asp Ser Trp Val Met Thr Leu Phe Arg Trp Gin His Val His 

245 250 255 

tgg aca ttc atg tta cca ttc etc cgt etc teg tgg ctt ctt cag tea 816 
Trp Thr Phe Met Leu Pro Phe Leu Arg Leu Ser Trp Leu Leu Gin Ser 

260 265 270 

ate att ttt gtt agt cag atg cca act cat tat tat gac tat tac aga 664 
lie lie Phe Val Ser Gin Met Pro Thr His Tyr Tyr Asp Tyr Tyr Arg 
275 280 285 

aat act gcg att tat gaa cag gtt ggt etc tct ttg cac tgg get tgg 912 
Asn Thr Ala He Tyr Glu Gin Val Gly Leu Ser Leu His Trp Ala Trp 
290 295 300 

tea ttg ggt caa ttg tat ttc eta ccc gat tgg tea act aga ata atg 960 
Ser Leu Gly Gin Leu Tyr Phe Leu Pro Asp Trp Ser Thr Arg lie Met 
305 310 315 320 

ttc ttc ctt gtt tct cat ctt gtt gga ggt ttc ctg etc tct cat gta 1008 
Phe Phe Leu Val Ser His Leu Val Gly Gly Phe Leu Leu Ser His Val 

325 330 335 

gtt act ttc aat cat tat tea gtg gag aag ttt gca ttg age teg aac 1056 
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Val Thr Phe Asn His Tyr Ser Val Glu Lys Phe Ala Leu Ser Ser Asn 

340 345 350 

ate atg tea aat tac get tgt ctt caa ate atg acc aca aga aat atg 1104 
lie Met Ser Asn Tyr Ala Cys Leu Gin He Met Thr Thr Arg Asn Met 
355 360 365 

aga cct gga aga ttc att gac tgg ctt tgg gga ggt ctt aac tat cag 1152 
Arg Pro Gly Arg Phe He Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin 
370 375 380 



att gag cac cat ctt ttc cca acg atg cca cga cac aac ttg aac act 12 00 
He Glu His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Thr 
385 3 9° 395 400 



gtt atg cca ctt gtt aag gag ttt gca gca gca aat ggt tta cca tac 
Val Met Pro Leu Val Lys Glu Phe Ala Ala Ala Asn Gly Leu Pro Tyr 

405 410 415 



1248 



atg gtc gac gat tat ttc aca gga ttc tgg ctt gaa att gag caa ttc 1296 
Met Val Asp Asp Tyr Phe Thr Gly Phe Trp Leu Glu He Glu Gin Phe 

420 425 430 

cga aat att gca aat gtt get get aaa ttg act aaa aag att gee tag 1344 
Arg Asn He Ala Asn Val Ala Ala Lys Leu Thr Lys Lys He Ala 
435 440 445 



<210> 24 
<211> 447 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 24 

Met val Leu Arg Glu Gin Glu His 
1 5 

Gly Lys Trp Cys Gin He Asp Asp 

20 



Glu Pro Phe Phe He Lys He Asp 
10 is 

Ala Val Leu Arg Ser His Pro Gly 
25 30 
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Gly Ser Ala lie 
35 

His Thr Phe His 
50 

Leu Lys Lys Glu 
65 

Asp Asp Pro lie 

lie Ser Glu Lys 

100 

Arg Met Arg Val 
115 

Tyr lie Arg Lys 
130 

Tyr Leu Gin Tyr 
145 

Val Ala Trp Gin 



Gin Leu Phe Lys 

180 

Gly Asn Phe Leu 
195 

Asn Val His His 
210 

Asp Leu Val Pro 
225 

Ser Gin Asp Ser 



Thr Thr Tyr Lys 

40 

Thr Gly Ser Lys 

55 

Cys Pro Thr Gin 
70 

Lys Gly lie Asp 
85 

Arg Ser Ala Gin 



Arg Ala Glu Gly 

120 

lie Leu Glu Thr 
135 

His Thr Tyr Tyr 
150 

Gin Leu Gly Trp 
165 

Asn Arg Tyr Tyr 



Gin Gly Phe Ser 

200 

Ala Ala Thr Asn 
215 

Phe Tyr Ala Thr 
230 

Trp Val Met Thr 



65 

Asn Met Asp Ala 

Glu Ala Tyr Gin 

60 

Glu Pro Glu lie 

75 

Asp Val Asn Met 
90 

lie Asn Lys Ser 
105 

Leu Met Asp Gly 



lie Phe Thr He 

140 

Leu Pro Ser Ala 
155 

Leu lie His Glu 
170 

Asn Asp Leu Ala 
185 

Ser Gly Gly Trp 



Val Val Gly Arg 

220 

Val Ala Glu His 
235 

Leu Phe Arg Trp 



Thr Thr Val Phe 
45 

Trp Leu Thr Glu 



Pro Asp He Lys 

80 

Gly Thr Phe Asn 

95 

Phe Thr Asp Leu 
110 

Ser Pro Leu Phe 
125 

Leu Phe Ala Phe 



He Leu Met Gly 

160 

Phe Ala His His 
175 

Ser Tyr Phe Val 
190 

Lys Glu Gin His 
205 

Asp Gly Asp Leu 



Leu Asn Asn Tyr 

240 

Gin His Val His 
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245 



250 



255 



Trp Thr Phe Met Leu Pro Phe Leu Arg Leu Ser Trp Leu Leu Gin Ser 

260 265 270 

lie lie Phe Val Ser Gin Met Pro Thr His Tyr Tyr Asp Tyr Tyr Arg 
275 280 285 

Asn Thr Ala He Tyr Glu Gin Val Gly Leu Ser Leu His Trp Ala Trp 
290 295 300 

Ser Leu Gly Gin Leu Tyr Phe Leu Pro Asp Trp Ser Thr Arg He Met 
305 310 315 320 

Phe Phe Leu Val Ser His Leu Val Gly Gly Phe Leu Leu Ser His Val 

325 330 335 

Val Thr Phe Asn His Tyr Ser Val Glu Lys Phe Ala Leu Ser Ser Asn 

340 345 350 

He Met Ser Asn Tyr Ala Cys Leu Gin He Met Thr Thr Arg Asn Met 
355 360 365 

Arg Pro Gly Arg Phe He Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin 
370 375 380 

He Glu His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Thr 
385 390 395 400 

Val Met Pro Leu Val Lys Glu Phe Ala Ala Ala Asn Gly Leu Pro Tyr 

405 410 415 

Met Val Asp Asp Tyr Phe Thr Gly Phe Trp Leu Glu He Glu Gin Phe 

420 425 430 



Arg Asn lie Ala Asn Val Ala Ala Lys Leu Thr Lys Lys He Ala 
435 440 445 



<210> 25 



0093/0C064 



CA 02485060 2004-10-28 



67 

<211> 954 
<212> DNA 

<213> Mortierella alpina 

<220> 

<221> CDS 

<222> ( 1) . . (954) 

<223> A6-elongase 

<400> 25 

atg gcc gcc gca ate ttg gac aag gtc aac ttc ggc att gat cag ccc 48 

Met Ala Ala Ala lie Leu Asp Lys Val Asn Phe Gly lie Asp Gin Pro 
1 5 10 15 

ttc gga ate aag etc gac acc tac ttt get cag gcc tat gaa etc gtc 96 
Phe Gly He Lys Leu Asp Thr Tyr Phe Ala Gin Ala Tyr Glu Leu Val 

20 25 30 

acc gga aag tec ate gac tec ttc gtc ttc cag gag ggc gtc acg cct 144 
Thr Gly Lys Ser He Asp Ser Phe Val Phe Gin Glu Gly Val Thr Pro 
35 40 45 

etc teg acc cag aga gag gtc gcc atg tgg act ate act tac ttc gtc 192 
Leu Ser Thr Gin Arg Glu Val Ala Met Trp Thr He Thr Tyr Phe Val 
50 55 60 

gtc ate ttt ggt ggt cgc cag ate atg aag age cag gac gcc ttc aag 240 
Val He Phe Gly Gly Arg Gin He Met Lys Ser Gin Asp Ala Phe Lys 
65 70 75 80 

etc aag ccc etc ttc ate etc cac aac ttc etc ctg acg ate gcg tec 288 
Leu Lys Pro Leu Phe He Leu His Asn Phe Leu Leu Thr He Ala Ser 

85 90 95 

gga teg ctg ttg etc ctg ttc ate gag aac ctg gtc ccc ate etc gcc 336 
Gly Ser Leu Leu Leu Leu Phe He Glu Asn Leu Val Pro He Leu Ala 

100 105 no 



aga aac gga ctt ttc tac gcc ate tgc gac gac ggt gcc tgg acc cag 384 
Arg Asn Gly Leu Phe Tyr Ala He Cys Asp Asp Gly Ala Trp Thr Gin 
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cgc etc gag etc 
Arg Leu Glu Leu 

130 

ttg gec gac acc 
Leu Ala Asp Thr 
145 

ctg cac tac ttc 
Leu His Tyr Phe 

ctt gga gga tac 
Leu Gly Gly Tyr 

180 

act gtc cac gtc 
Thr Val His Val 
195 

gtt cgc ate tgg 
Val Arg lie Trp 
210 

ttc gtt ctt gac 
Phe Val Leu Asp 
225 

gee ttc acc tac 
Ala Phe Thr Tyr 

acc gag ggt get 
Thr Glu Gly Ala 

260 

ttg etc ttt ate 
Leu Leu Phe lie 



120 

etc tac tac etc aac 
Leu Tyr Tyr Leu Asn 
135 

gtc ttt ttg gtc etc 
Val Phe Leu Val Leu 

150 

cac cac teg atg acc 
His His Ser Met Thr 
165 

act tea gtg tec tgg 
Thr Ser Val Ser Trp 

185 

ttc atg tac tac tac 
Phe Met Tyr Tyr Tyr 

200 . 

tgg aag cag tac ttg 
Trp Lys Gin Tyr Leu 
215 

etc gga ttc ate tac 
Leu Gly Phe lie Tyr 
230 

ttc ccc tgg get ccc 
Phe Pro Trp Ala Pro 
245 

get etc ttt ggc tgc 
Ala Leu Phe Gly Cys 

265 

aac ttc tac cgc att 
Asn Phe Tyr Arg He 
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125 

tac ctg gtc aag 
Tyr Leu Val Lys 
140 

aag aag aag cct 
Lys Lys Lys Pro 
155 

atg gtt etc tgc 
Met Val Leu Cys 
170 

gtc cct att acc 
Val Pro He Thr 



tac atg cgc tec 
Tyr Met Arg Ser 

205 

acc act etc cag 
Thr Thr Leu Gin 
220 

ttc tgc gee tac 
Phe Cys Ala Tyr 
235 

aac gtc ggc aag 
Asn Val Gly Lys 
250 

gga etc etc tec 
Gly Leu Leu Ser 

acc tac aat gee 
Thr Tyr Asn Ala 



tac tgg gag 432 
Tyr Trp Glu 

ctt gag ttc 480 
Leu Glu Phe 
160 

ttt gtc cag 528 
Phe Val Gin 
175 

etc aac ttg 576 
Leu Asn Leu 
190 

get gee ggt 624 
Ala Ala Gly 



ate gtc cag 672 
He Val Gin 

acc tac ttc 720 
Thr Tyr Phe 
240 

tgc gec ggt 768 
Cys Ala Gly 
255 

age tat etc 816 

Ser Tyr Leu 

270 

aag gee aag 864 
Lys Ala Lys 
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gca gcc aag gag cgt gga 

Ala Ala Lys Glu Arg Gly 
290 

ggc gga teg ccc aag aag 

Gly Gly Ser Pro Lys Lys 

305 310 
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280 

age aac ttt acc ccc 
Ser Asn Phe Thr Pro 
295 

ccc tec aag age aag 
Pro Ser Lys Ser Lys 

315 



285 ' 

aag act gtc aag tec 912 
Lys Thr Val Lys Ser 
300 

cac ate taa 954 
His He 



<210> 26 
<2ll> 317 
<212> PRT 

<213> Mortierella alpina 
<400> 26 

Met Ala Ala Ala He Leu Asp Lys Val Asn Phe Gly He Asp Gin Pro 
15 10 15 

Phe Gly He Lys Leu Asp Thr Tyr Phe Ala Gin Ala Tyr Glu Leu Val 

20 25 30 

Thr Gly Lys Ser He Asp Ser Phe Val Phe Gin Glu Gly Val Thr Pro 
35 40 45 

Leu Ser Thr Gin Arg Glu Val Ala Met Trp Thr He Thr Tyr Phe Val 
50 55 60 

Val He Phe Gly Gly Arg Gin He Met Lys Ser Gin Asp Ala Phe Lys 
65 70 75 80 

Leu Lys Pro Leu Phe He Leu His Asn Phe Leu Leu Thr He Ala Ser 

85 90 95 

Gly Ser Leu Leu Leu Leu Phe He Glu Asn Leu Val Pro He Leu Ala 

100 105 110 



Arg Asn Gly Leu Phe Tyr Ala He Cys Asp Asp Gly Ala Trp Thr Gin 
115 120 125 
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Arg Leu Glu Leu Leu 
130 

Leu Ala Asp Thr Val 
145 

Leu His Tyr Phe His 

165 

Leu Gly Gly Tyr Thr 

180 

Thr Val His Val Phe 
195 

Val Arg lie Trp Trp 
210 

Phe Val Leu Asp Leu 
225 

Ala Phe Thr Tyr Phe 

245 

Thr Glu Gly Ala Ala 

260 

Leu Leu Phe lie Asn 
275 

Ala Ala Lys Glu Arg 
290 

Gly Gly Ser Pro Lys 
305 



70 

Tyr Tyr Leu Asn Tyr Leu 
135 

Phe Leu Val Leu Lys Lys 
150 155 

His Ser Met Thr Met Val 

170 

Ser Val Ser Trp Val Pro 

185 

Met Tyr Tyr Tyr Tyr Met 
200 

Lys Gin Tyr Leu Thr Thr 
215 

Gly Phe lie Tyr Phe Cys 
230 235 

Pro Trp Ala Pro Asn Val 

250 

Leu Phe Gly Cys Gly Leu 

265 

Phe Tyr Arg lie Thr Tyr 
280 

Gly Ser Asn Phe Thr Pro 
295 

Lys Pro Ser Lys Ser Lys 
310 315 



Val Lys Tyr Trp Glu 

140 

Lys Pro Leu Glu Phe 

160 

Leu Cys Phe Val Gin 

175 

lie Thr Leu Asn Leu 
190 

Arg Ser Ala Ala Gly 
205 

Leu Gin lie Val Gin 
220 

Ala Tyr Thr Tyr Phe 

240 

Gly Lys Cys Ala Gly 

255 

Leu Ser Ser Tyr Leu 
270 

Asn Ala Lys Ala Lys 
285 

Lys Thr Val Lys Ser 
300 

His lie 



<210> 27 
<211> 1320 
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<212> DNA 

<213> Thraustochytrium 

<220> 

<221> CDS 

<222> (1)..(1320) 

<223> A5-desaturase 



<400> 27 

atg ggc aag ggc age gag ggc cgc age gcg gcg cgc gag atg acg gec 

Met Gly Lys Gly Ser Glu Gly Arg Ser Ala Ala Arg Glu Met Thr Ala 
15 10 15 



48 



gag gcg aac ggc gac aag egg aaa acg att ctg ate gag ggc gtc ctg 
Glu Ala Asn Gly Asp Lys Arg Lys Thr lie Leu lie Glu Gly Val Leu 

20 25 30 



96 



tac gac gcg acg aac ttt aag cac ccg ggc ggt teg ate ate aac ttc 
Tyr Asp Ala Thr Asn Phe Lys His Pro Gly Gly Ser lie lie Asn Phe 
35 40 45 



144 



ttg acc gag ggc gag gec ggc gtg gac gcg acg cag gcg tac cgc gag 
Leu Thr Glu Gly Glu Ala Gly Val Asp Ala Thr Gin Ala Tyr Arg Glu 
50 55 60 



192 



ttt cat cag egg tec ggc aag gec gac aag tac etc aag teg ctg ccg 
Phe His Gin Arg Ser Gly Lys Ala Asp Lys Tyr Leu Lys Ser Leu Pro 
65 70 75 80 



aag ctg gat gcg tec aag gtg gag teg egg ttc teg gec aaa gag cag 
Lys Leu Asp Ala Ser Lys Val Glu Ser Arg Phe Ser Ala Lys Glu Gin 

85 90 95 



240 



cag 288 



gcg egg cgc gac gee atg acg cgc gac tac gcg gec ttt cgc gag gag 
Ala Arg Arg Asp Ala Met Thr Arg Asp Tyr Ala Ala Phe Arg Glu Glu 

100 105 110 



336 



etc gtc gec gag ggg tac ttt gac ccg teg ate ccg cac atg att tac 
Leu Val Ala Glu Gly Tyr Phe Asp Pro Ser lie Pro His Met lie Tyr 
115 120 125 



384 
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cgc gtc gtg gag 
Arg Val Val Glu 
130 

tec aag gec teg 
Ser Lys Ala Ser 
145 

att gcg cag ggc 
lie Ala Gin Gly 

teg ttc acg ggc 
Ser Phe Thr Gly 

180 

tac ggc gtc ggc 
Tyr Gly Val Gly 
195 

age aag cac cac 
Ser Lys His His 
210 

aac acg ctg ccc 
Asn Thr Leu Pro 
225 

aag ccg gga teg 
Lys Pro Gly Ser 

ttt gcg ccc gtc 
Phe Ala Pro Val 

260 

ctg cac ccg cgc 
Leu His Pro Arg 
275 



ate gtg gcg 
He Val Ala 
135 

ccc acc teg 
Pro Thr Ser 
150 

cgc tgc ggc 
Arg Cys Gly 
165 

gtc ate tgg 
Val He Trp 



tgc ggc atg 
Cys Gly Met 

gec gcg ccc 
Ala Ala Pro 
215 

ctg gtc gec 
Leu Val Ala 
230 

ctg ctg gcg 
Leu Leu Ala 
245 

teg tgc ctg 
Ser Cys Leu 

tac atg ctg 
Tyr Met Leu 
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etc ttc gcg etc 
Leu Phe Ala Leu 

etc gtg ctg ggc 
Leu Val Leu Gly 

155 

tgg gtc atg cac 
Trp Val Met His 
170 

etc gac gac egg 
Leu Asp Asp Arg 
185 

age ggg cac tac 
Ser Gly His Tyr 
200 

aac cgc etc gag 
Asn Arg Leu Glu 

ttt aac gag cgc 
Phe Asn Glu Arg 

235 

etc tgg ctg cgc 
Leu Trp Leu Arg 
250 

etc ate ggc ctt 
Leu He Gly Leu 
265 

cgc acc aag egg 
Arg Thr Lys Arg 
280 



teg ttc tgg 
Ser Phe Trp 
140 

gtg gtg atg 
Val val Met 



gag atg ggc 
Glu Met Gly 

atg tgc gag 
Met Cys Glu 
190 

tgg aag aac 
Trp Lys Asn 
205 

cac gat gtc 
His Asp Val 
220 

gtc gtg cgc 
Val Val Arg 



gtg cag gcg 
Val Gin Ala 



ggc tgg acg 
Gly Trp Thr 
270 

cac atg gag 
His Met Glu 
285 



etc atg 432 
Leu Met 



aac ggc 480 
Asn Gly 
160 

cac ggg 528 

His Gly 

175 

ttc ttc 576 
Phe Phe 



cag cac 624 
Gin His 

gat etc 672 
Asp Leu 

aag gtc 720 
Lys val 
240 

tac etc 768 

Tyr Leu 

255 

etc tac 816 
Leu Tyr 

ttc gtc 864 
Phe Val 
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tgg ate ttc gcg cgc tac att ggc tgg ttc teg etc atg ggc get etc 912 

Trp lie Phe Ala Arg Tyr lie Gly Trp Phe Ser Leu Met Gly Ala Leu 

290 295 300 



ggc tac teg ccg 
Gly Tyr Ser Pro 
305 

etc ggc tgc att 
Leu Gly Cys He 

ctg ccg gtg acc 
Leu Pro Val Thr 

340 

gec gac eac acg 
Ala Asp His Thr 
355 

tgg atg teg aac 
Trp Met Ser Asn 
370 



ggc acc teg 
Gly Thr Ser 
310 

tac att ttc 
Tyr He Phe 
325 

aac ccg gag 
Asn Pro Glu 

gtg aac att 
Val Asn He 



ctg aac ttt 
Leu Asn Phe 
375 



gtc ggg atg tac 
Val Gly Met Tyr 

315 

ctg cag ttc gee 
Leu Gin Phe Ala 
330 

gac cag ctg cac 
Asp Gin Leu His 
345 

age acc aag tec 
Ser Thr Lys Ser 
360 

cag ate gag cac 
Gin He Glu His 



ctg tgc teg 
Leu Cys Ser 

gtc age cac 
Val Ser His 



tgg etc gag 
Trp Leu Glu 
350 

tgg etc gtc 
Trp Leu Val 
365 

cac etc ttc 
His Leu Phe 
380 



ttc ggc 960 
Phe Gly 
320 

acg cac 1008 

Thr His 

335 

tac gcg 1056 
Tyr Ala 

* 

acg tgg 1104 
Thr Trp 

ccc acg 115 2 
Pro Thr 



gcg ccg cag ttc cgc ttc aag gaa ate agt cct cgc gtc gag gee etc 1200 
Ala Pro Gin Phe Arg Phe Lys Glu lie Ser Pro Arg Val Glu Ala Leu 
385 390 395 400 



ttc aag cgc cac aac etc ccg tac tac gac ctg ccc tac acg age gcg 124 8 
Phe Lys Arg His Asn Leu Pro Tyr Tyr Asp Leu Pro Tyr Thr Ser Ala 

405 410 415 



gtc teg acc acc ttt gee aat ctt tat tec gtc ggc cac teg gtc ggc 1296 
Val Ser Thr Thr Phe Ala Asn Leu Tyr Ser Val Gly His Ser Val Gly 

420 425 430 



gee gac acc aag aag cag gac tga 
Ala Asp Thr Lys Lys Gin Asp 

435 440 



1320 
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<210> 28 
<211> 439 
<212> PRT 

<213> Thraustochytrium 
<400> 28 

Met Gly Lys Gly Ser Glu Gly Arg Ser Ala Ala Arg Glu Met Thr Ala 
15 10 15 

Glu Ala Asn Gly Asp Lys Arg Lys Thr lie Leu lie Glu Gly Val Leu 

20 25 30 

Tyr Asp Ala Thr Asn Phe Lys His Pro Gly Gly Ser lie lie Asn Phe 
35 40 45 

Leu Thr Glu Gly Glu Ala Gly Val Asp Ala Thr Gin Ala Tyr Arg Glu 
50 55 60 

Phe His Gin Arg Ser Gly Lys Ala Asp Lys Tyr Leu Lys Ser Leu Pro 
65 70 75 80 

Lys Leu Asp Ala Ser Lys Val Glu Ser Arg Phe Ser Ala Lys Glu Gin 

85 90 95 

Ala Arg Arg Asp Ala Met Thr Arg Asp Tyr Ala Ala Phe Arg Glu Glu 

100 105 110 

Leu Val Ala Glu Gly Tyr Phe Asp Pro Ser lie Pro His Met lie Tyr 
115 120 125 

Arg val Val Glu He Val Ala Leu Phe Ala Leu Ser Phe Trp Leu Met 
130 135 140 

Ser Lys Ala Ser Pro Thr Ser Leu Val Leu Gly Val Val Met Asn Gly 
145 150 155 160 



He Ala Gin Gly Arg Cys Gly Trp Val Met His Glu Met Gly His Gly 

165 170 175 
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Ser Phe Thr Gly Val 

180 

Tyr Gly Val Gly Cys 
195 

Ser Lys His His Ala 
210 

Asn Thr Leu Pro Leu 
225 

Lys Pro Gly Ser Leu 

245 

Phe Ala Pro Val Ser 

260 

Leu His Pro Arg Tyr 
275 

Trp lie Phe Ala Arg 
290 

Gly Tyr Ser Pro Gly 
305 

Leu Gly Cys lie Tyr 

325 

Leu Pro Val Thr Asn 

340 

Ala Asp His Thr Val 
355 



75 

lie Trp Leu Asp Asp Arg 

185 

Gly Met Ser Gly His Tyr 
200 

Ala Pro Asn Arg Leu Glu 

215 

Val Ala Phe Asn Glu Arg 
230 235 

Leu Ala Leu Trp Leu Arg 

250 

Cys Leu Leu lie Gly Leu 

265 

Met Leu Arg Thr Lys Arg 
280 

Tyr lie Gly Trp Phe Ser 
295 

Thr Ser Val Gly Met Tyr 
310 315 

lie Phe Leu Gin Phe Ala 

330 

Pro Glu Asp Gin Leu His 

345 

Asn lie Ser Thr Lys Ser 
360 



Met Cys Glu Phe Phe 
190 

Trp Lys Asn Gin His 
205 

His Asp Val Asp Leu 
220 

Val Val Arg Lys Val 

240 

Val Gin Ala Tyr Leu 

255 

Gly Trp Thr Leu Tyr 
270 

His Met Glu Phe Val 
285 

Leu Met Gly Ala Leu 
300 

Leu Cys Ser Phe Gly 

320 

Val Ser His Thr His 

335 

Trp Leu Glu Tyr Ala 
350 

Trp Leu Val Thr Trp 
365 



Trp Met Ser Asn Leu Asn Phe Gin lie Glu His His Leu Phe Pro Thr 
370 375 380 
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Ala Pro Gin Phe 
385 

Phe Lys Arg His 



Val Ser Thr Thr 

420 



Arg Phe Lys Gin 
390 

Asn Leu Pro Tyr 
405 

Phe Ala Asn Leu 



76 

lie Ser Pro Arg 
395 

Tyr Asp Leu Pro 
410 

Tyr Ser Val Gly 
425 



Val Glu Ala Leu 

400 

Tyr Thr Ser Ala 
415 

His Ser Val Gly 
430 



Ala Asp Thr Lys Lys Gin Asp 
435 



<210> 29 
<211> 957 
<212> DNA 

<213> Mortierella alpina 

<220> 

<221> CDS 

<222> (1) . - (957) 

<223> A6-elongase 



<400> 29 

atg gag teg at-t gcg cca ttc etc 
Met Glu Ser lie Ala Pro Phe Leu 
1 5 

ttt atg gac ctt gec acc get ate 
Phe Met Asp Leu Ala Thr Ala lie 

20 



cca tea aag atg ccg caa gat ctg 48 
Pro Ser Lys Met Pro Gin Asp Leu 
10 15 

ggt gtc egg gee gcg ccc tat gtc 96 
Gly Val Arg Ala Ala Pro Tyr Val 
25 30 



gat cct etc gag gee gcg ctg gtg gee cag gee gag aag tac ate ccc 144 
Asp Pro Leu Glu Ala Ala Leu Val Ala Gin Ala Glu Lys Tyr He Pro 

35 40 45 

acg att gtc cat cac acg cgt ggg ttc ctg gtc gcg gtg gag teg cct 192 
Thr lie Val His His Thr Arg Gly Phe Leu Val Ala Val Glu Ser Pro 
50 55 60 
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ttg gcc cgt gag ctg ccg ttg atg aac ccg ttc cac gtg Ctg ttg ate 240 
Leu Ala Arg Glu Leu Pro Leu Met Asn Pro Phe His Val Leu Leu He 
65 70 75 80 

gtg etc get tat ttg gtc acg gtc ttt gtg ggc atg cag ate atg aag 288 
Val Leu Ala Tyr Leu Val Thr Val Phe Val Gly Met Gin He Met Lys 

85 90 95 

aac ttt gag egg ttc gag gtc aag acg ttt teg etc ctg cac aac ttt 336 
Asn Phe Glu Arg Phe Glu Val Lys Thr Phe Ser Leu Leu His Asn Phe 

100 105 110 

tgt ctg gtc teg ate age gcc tac atg tgc ggt ggg ate ctg tac gag 384 
Cys Leu Val Ser He Ser Ala Tyr Met Cys Gly Gly He Leu Tyr Glu 
115 120 125 

get tat cag gcc aac tat gga ctg ttt gag aac get get gat cat ace 432 
Ala Tyr Gin Ala Asn Tyr Gly Leu Phe Glu Asn Ala Ala Asp His Thr 
130 135 140 

ttc aag ggt ctt cct atg gcc aag atg ate tgg etc ttc tac ttc tec 480 

Phe Lys Gly Leu Pro Met Ala Lys Met He Trp Leu Phe Tyr Phe Ser 

145 150 155 160 

a 

aag ate atg gag ttt gtc gac ace atg ate atg gtc etc aag aag aac 528 

Lys He Met Glu Phe Val Asp Thr Met He Met Val Leu Lys Lys Asn 

165 170 175 

- • 

aac cgc cag ate tec ttc ttg cac gtt tac cac cac age tec ate ttc 576 
Asn Arg Gin He Ser Phe Leu His Val Tyr His His Ser Ser He Phe 

180 185 190 

ace ate tgg tgg ttg gtc ace ttt gtt gca ccc aac ggt gaa gcc tac 624 
Thr He Trp Trp Leu Val Thr Phe Val Ala Pro Asn Gly Glu Ala Tyr 
195 200 205 



ttc tct get gcg ttg aac teg ttc ate cat gtg ate atg tac ggc tac 
Phe Ser Ala Ala Leu Asn Ser Phe He His Val He Met Tyr Gly Tyr 
210 215 220 



672 
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tac ttc ttg teg gec ttg ggc ttc aag cag gtg teg ttc ate aag ttc 720 

Tyr Phe Leu Ser Ala Leu Gly Phe Lys Gin Val Ser Phe lie Lys Phe 
225 230 235 240 



tac ate acg cgc teg cag atg aca cag ttc tgc atg atg teg gtc cag 768 
Tyr lie Thr Arg Ser Gin Met Thr Gin Phe Cys Met Met Ser Val Gin 

245 250 255 



tct tec tgg gac atg 
Ser Ser Trp Asp Met 

260 

ccc ttc ttc ate acg 
Pro Phe Phe lie Thr 
275 

ggt etc ttc tac aac 
Gly Leu Phe Tyr Asn 
290 



tac gec atg aag 
Tyr Ala Met Lys 

265 

get ctg ctt tgg 
Ala Leu Leu Trp 
280 

ttt tac aga aag 
Phe Tyr Arg Lys 
295 



gtc ctt ggc cgc 
Val Leu Gly Arg 

ttc tac atg tgg 
Phe Tyr Met Trp 

285 

aac gec aag ttg 
Asn Ala Lys Leu 
300 



ccc gga tac 816 
Pro Gly Tyr 
270 

ace atg etc 864 
Thr Met Leu 

gec aag cag 912 
Ala Lys Gin 



gee aag gec gac get gec aag gag aag gca agg aag ttg cag taa 957 
Ala Lys Ala Asp Ala Ala Lys Glu Lys Ala Arg Lys Leu Gin 
305 310 315 



<210> 30 
<211> 318 
<212> PRT 

<213> Mortierella alpina 
<400> 30 

Met Glu Ser lie Ala Pro Phe Leu Pro Ser Lys Met Pro Gin Asp Leu 
15 10 15 

Phe Met Asp Leu Ala Thr Ala lie Gly Val Arg Ala Ala Pro Tyr Val 

20 25 30 

Asp Pro Leu Glu Ala Ala Leu Val Ala Gin Ala Glu Lys Tyr lie Pro 
35 40 45 
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Thr lie Val His 
50 

Leu Ala Arg Glu 
65 

Val Leu Ala Tyr 



Asn Phe Glu Arg 

100 

Cys Leu Val Ser 
115 

Ala Tyr Gin Ala 
130 

Phe Lys Gly Leu 
145 

Lys lie Met Glu 



Asn Arg Gin lie 

180 

Thr lie Trp Trp 
195 

Phe Ser Ala Ala 
210 

Tyr Phe Leu Ser 
225 

Tyr lie Thr Arg 



His Thr Arg Gly 

55 

Leu Pro Leu Met 
70 

Leu Val Thr Val 
85 

Phe Glu Val Lys 



lie Ser Ala Tyr 

120 

Asn Tyr Gly Leu 
135 

Pro Met Ala Lys 
150 

Phe Val Asp Thr 
165 

Ser Phe Leu His 



Leu Val Thr Phe 

200 

Leu Asn Ser Phe 
215 

Ala Leu Gly Phe 
230 

Ser Gin Met Thr 
245 



79 

Phe Leu Val Ala 

60 

Asn Pro Phe His 

75 

Phe Val Gly Met 
90 

Thr Phe Ser Leu 
105 

Met Cys Gly Gly 



Phe Glu Asn Ala 

140 

Met He Trp Leu 
155 

■ 

Met He Met Val 
170 

Val Tyr His His 
185 

Val Ala Pro Asn 



He His Val He 

220 

Lys Gin Val Ser 
235 

Gin Phe Cys Met 
250 



Val Glu Ser Pro 



Val Leu Leu He 

60 

Gin He Met Lys 

95 

Leu His Asn Phe 
110 

He Leu Tyr Glu 
125 

Ala Asp His Thr 



Phe Tyr Phe Ser 

160 

Leu Lys Lys Asn 
175 

Ser Ser He Phe 
190 

Gly Glu Ala Tyr 
205 

Met Tyr Gly Tyr 



Phe He Lys Phe 

240 

Met Ser Val Gin 
255 



Ser Ser Trp Asp Met Tyr Ala Met Lys Val Leu Gly Arg Pro Gly Tyr 
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260 265 " 270 

Pro Phe Phe lie Thr Ala Leu Leu Trp Phe Tyr Met Trp Thr Met Leu 
275 280 285 

Gly Leu Phe Tyr Asn Phe Tyr Arg Lys Asn Ala Lys Leu Ala Lys Gin 
290 295 300 

Ala Lys Ala Asp Ala Ala Lys Glu Lys Ala Arg Lys Leu Gin 
305 310 315 

<210> 31 
<211> 1374 

■ 

<212> DNA 

<213> Mortierella alpina 

<220> 

<221> CDS 

<222> (1J..{1374) 

<223> A6-desaturase 

<400> 31 

atg get get get ccc agt gtg agg acg ttt act egg gee. gag gtt ttg 48 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 

1 5 10 15 

aat gee gag get ctg aat gag ggc aag aag gat gec gag gca ccc ttc 96 
Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 

20 25 30 

ttg atg ate ate gac aac aag gtg tac gat gtt cgc gag ttc gtc cct 144 
Leu Met He lie Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 

35 40 45 

gat cat ccc ggt gga agt gtg att etc acg cac gtt ggc aag gac ggc 192 
Asp His Pro Gly Gly Ser Val lie Leu Thr His Val Gly Lys Asp Gly 
50 55 60 



act gac gtc ttt gac act ttt cac ccc gag get get tgg gag act ctt 2 40 
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Thr Asp Val Phe 
65 

gcc aac ttt tac 
Ala Asn Phe Tyr 

aat gat gac ttt 
Asn Asp Asp Phe 

100 

tct ctt ggt tac 
Ser Leu Gly Tyr 
115 

teg ttc aac etc 
Ser Phe Asn Leu 
130 

tgg ggc cag ace 
Trp Gly Gin Thr 
145 

ggt ctg ttc tgg 
Gly Leu Phe Trp 

cac cag gtc ttc 
His Gin Val Phe 

180 

ttg gga ggt gtc 

Leu Gly Gly Val 
195 

cac aac act cac 
His Asn Thr His 
210 



Asp Thr Phe 
70 

gtt ggt gat 

Val Gly Asp 
85 

gcg gcc gag 
Ala Ala Glu 

tac gat tct 
Tyr Asp Ser 

tgc ate tgg 
Cys lie Trp 
135 

teg ace etc 
Ser Thr Leu 
150 

cag cag tgc 
Gin Gin Cys 
165 

cag gac cgt 
Gin Asp Arg 

tgc cag ggc 
Cys Gin Gly 

cac gcc gcc 
His Ala Ala 
215 



81 

His Pro Glu Ala 

75 

att gac gag age 
lie Asp Glu Ser 
90 

gtc cgc aag ctg 
Val Arg Lys Leu 
105 

tec aag gca tac 
Ser Lys Ala Tyr 
120 

ggt ttg teg acg 
Gly Leu Ser Thr 

gcc aac gtg etc 
Ala Asn Val Leu 

155 

gga tgg ttg get 
Gly Trp Leu Ala 
170 

ttc tgg ggt gat 
Phe Trp Gly Asp 
185 

ttc teg tec teg 
Phe Ser Ser Ser 
200 

ccc aac gtc cac 
Pro Asn Val His 



Ala Trp Glu 

gac cgc gat 
Asp Arg Asp 

cgt ace ttg 
Arg Thr Leu 
110 

tac gcc ttc 
Tyr Ala Phe 
125 

gtc att gtg 
Val lie Val 

140 

teg get gcg 
Ser Ala Ala 



cac gac ttt 
His Asp Phe 

ctt ttc ggc 
Leu Phe Gly 
190 

tgg tgg aag 
Trp Trp Lys 
205 

ggc gag gat 
Gly Glu Asp 
220 



Thr Leu 



ate aag 28B 
lie Lys 
95 

ttc cag 336 
Phe Gin 



aag gtc 384 
Lys Val 

* 

gcc aag 432 
Ala Lys 

ctt ttg 480 
Leu Leu 
160 

ttg cat 528 
Leu His 
175 

gcc ttc 576 
Ala Phe 

gac aag 624 
Asp Lys 

ccc gac 672 
Pro Asp 



att gac ace cac cct ctg ttg ace tgg agt gag cat gcg ttg gag atg 



720 
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He Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

ttc teg gat gtc cca gat gag gag ctg acc cgc atg tgg teg cgt ttc 768 
Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 

245 250 255 

atg gtc ctg aac cag acc tgg ttt tac ttc ccc att etc teg ttt gec 816 
Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro He Leu Ser Phe Ala 

260 265 270 

cgt etc tec tgg tgc etc cag tec att etc ttt gtg ctg cct aac ggt 864 
Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 

cag gec cac aag ccc teg ggc gcg cgt gtg ccc ate teg ttg gtc gag 912 
Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro He Ser Leu Val Glu 

290 295 300 

cag ctg teg ctt gcg atg cac tgg acc tgg tac etc gec acc atg ttc 960 
Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 

ctg ttc ate aag gat ccc gtc aac atg ctg gtg tac ttt ttg gtg teg 1008 
Leu Phe He Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 

325 330 335 

cag gcg gtg tgc gga aac ttg ttg gcg ate gtg ttc teg etc aac cac 1056 
Gin Ala Val Cys Gly Asn Leu Leu Ala He Val Phe Ser Leu Asn His 

340 345 350 

aac ggt atg cct gtg ate teg aag gag gag gcg gtc gat atg gat ttc 1104 
Asn Gly Met Pro Val He Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

ttc acg aag cag ate ate acg ggt cgt gat gtc cac ccg ggt eta ttt 1152 
Phe Thr Lys Gin He He Thr Gly Arg Asp Val His Pro Gly Leu Phe 
370 375 380 



gee aac tgg ttc acg ggt gga ttg aac tat cag ate gag cac cac ttg 



1200 
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Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin lie Glu His His Leu 
385 390 395 400 

ttc cct teg atg cct cgc cac aac ttt tea aag ate cag cct get gtc 1248 
Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys lie Gin Pro Ala Val 

405 410 415 

gag acc ctg tgc aaa aag tac aat gtc cga tac cac acc acc ggt atg 1296 
Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 

420 425 430 

ate gag gga act gca gag gtc ttt age cgt ctg aac gag gtc tec aag 1344 
lie Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

get gec tec aag atg ggt aag gcg cag taa 1374 
Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 



<210> 32 
<211> 457 
<212> PRT 

<213> Mortierella alpina 
<400> 32 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
15 10 15 

Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 

20 25 30 

Leu Met He He Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 

35 40 45 

Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Gly 
50 55 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 70 75 80 
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Ala Asn Phe Tyr 



Asn Asp Asp Phe 

100 

Ser Leu Gly Tyr 
115 

Ser Phe Asn Leu 
130 

Trp Gly Gin Thr 
145 

Gly Leu Phe Trp 



His Gin Val Phe 

180 

Leu Gly Gly Val 
195 

His Asn Thr His 
210 

He Asp Thr His 
225 

Phe Ser Asp Val 



Met Val Leu Asn 

260 

Arg Leu Ser Trp 
275 



Val Gly Asp He 
85 

Ala Ala Glu Val 



Tyr Asp Ser Ser 

120 

Cys He Trp Gly 
135 

Ser Thr Leu Ala 
150 

Gin Gin Cys Gly 
165 

Gin Asp Arg Phe 



Cys Gin Gly Phe 

200 

His Ala Ala Pro 
215 

Pro Leu Leu Thr 
230 

Pro Asp Glu Glu 
245 

Gin Thr Trp Phe 



Cys Leu Gin Ser 

280 



84 

Asp Glu Ser Asp 
90 

Arg Lys Leu Arg 
105 

Lys Ala Tyr Tyr 



Leu Ser Thr Val 

140 

Asn Val Leu Ser 
155 

Trp Leu Ala His 
170 

Trp Gly Asp Leu 
185 

• 

Ser Ser Ser Trp 



Asn Val His Gly 

220 

Trp Ser Glu His 
235 

Leu Thr Arg Met 
250 

Tyr Phe Pro He 
265 

He Leu Phe Val 



Arg Asp He Lys 

95 

Thr Leu Phe Gin 
110 

Ala Phe Lys Val 
125 

He Val Ala Lys 



Ala Ala Leu Leu 

160 

Asp Phe Leu His 
175 

Phe Gly Ala Phe 
190 

Trp Lys Asp Lys 
205 

Glu Asp Pro Asp 



Ala Leu Glu Met 

240 

Trp Ser Arg Phe 
255 

Leu Ser Phe Ala 
270 

Leu Pro Asn Gly 
285 



0093/00064 CA 02485060 2004-10-28 

85 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro He Ser Leu Val Glu 
290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 

Leu Phe He Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 

325 330 335 

Gin Ala Val Cys Gly Asn Leu Leu Ala He Val Phe Ser Leu Asn His 

340 345 350 

Asn Gly Met Pro Val He Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

Phe Thr Lys Gin He He Thr Gly Arg Asp Val His Pro Gly Leu Phe 
370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys He Gin Pro Ala Val 

405 410 415 

Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 

420 425 430 

He Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 



<210> 33 
<211> 3598 
<212> DNA 
<213> Unknown 



<220> 
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<223> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 
pUC19 

<400> 33 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cggcgcgccg agctcctcga 420 

gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 

gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 

tttatgctaa cgtttgccaa cacttagcaa. tttgcaagtt gattaattga ttctaaatta -600 

tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 660 

tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 

gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 

taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 

agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 

ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 

ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 

atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 
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taatttcttc atagccagcc caccgcggtg ggcggccgcc tgcagtctag aaggcctcct 1140 



gctttaatga gatatgcgag acgcctatga tcgcatgata tttgctttca attctgttgt 1200 



gcacgttgta aaaaacctga gcatgtgtag ctcagatcct taccgccggt ttcggttcat 1260 



tctaatgaat atatcacccg ttactatcgt atttttatga ataatattct ccgttcaatt 1320 



tactgattgt ccgtcgacga attcgagctc ggcgcgccaa gcttggcgta atcatggtca 1380 



tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga 1440 
agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg 1500 
cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc 1560 



caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 1620 
tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 1680 
cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 1740 
aaggccagga accgtaaaaa ggccgcgttg. ctggcgtttt tccataggct ccgcccccc-t 1800 
gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 1860 
agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 1920 
cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 1980 
cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 2040 
ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 2100 
gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 2160 
tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaagg 2220 
acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 2280 
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tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 2340 
attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 2400 
gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 246 0 
ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 252 0 
taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 2580 
ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag 264 0 
ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 270 0 
gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact .276 0 
ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 282 0 
gttaatagtt tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg 288 0 
tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 2940 
atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg 3000 
gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca 3060 
tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 3120 
atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc 3180 
agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc 3240 
ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 3300 
tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 3360 
aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 3420 
tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 3480 
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aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtctaagaa 3540 
accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtc 3598 

<210> 34 
<211> 3590 
<212> DNA 
<213> Unknown 

<220> 

<223> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 
PUC19 

• * 

* 

<400> 34 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
• cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cggcgcgccg agctcctcga 420 
gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 
gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 
tttatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaatta 600 
tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 660 
tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 
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gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 
taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 
agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 
ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 
ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 
atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 
taatttcttc atagccagcg gatccgatat cgggcccgct agcgttaacc ctgctttaat 114 0 
gagatatgcg agacgcctat gatcgcatga tatttgcttt caattctgtt gtgcacgttg 12 00 
taaaaaacct gagcatgtgt. agctcagatc cttaccgccg g-tttcggttc attctaatga 12 60 
atatatcacc cgttactatc gtatttttat gaataatatt ctccgttcaa tttactgatt 1320 
gtccgtcgac gaattcgagc tcggcgcgcc aagcttggcg taatcatggt catagctgtt 1380 
tcctgtgtga aattgttatc cgctcacaat tccacacaac atacgagccg- gaagcataaa -1440 
gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt tgcgctcact 1500 
gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc 1560 
ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg 1620 
ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc 1680 
cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag 1740 
gaaccgtaaa aaggccgcgl: tgctggcgtt tttccatagg ctccgccccc ctgacgagca 1800 
tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca 1860 
ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg 1920 
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ataccrtgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag 1980 

gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt 2 040 

tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca 2100 

cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg 2160 

cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt 2220 

tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc 2280 

cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg 2340 

cagaaaaaaa ggatctcaag aagatccttt gatcttttct. acggggtctg acgctcagtg 2 4 00 

gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta 2460 

gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg 2520 

gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg 2580 

ttcatccata gttgcctgac tccccgtcgt. gtagataact acgatacggg agggcttacc 2640 

atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc 2700 

agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc 2760 

ctccatccag tctattaatt gttgccggga agctagagta agtag-ttcgc cagt-taatag 2820 

tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat 2880 

ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg 2940 

caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt 3000 

gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag 3060 
atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg 3120 
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accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt 3180 
aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct 3240 
gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac 3300 
tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat 3360 
aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat 3420 
ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca 3480 
aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat 3540 
tatcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc 3590 

<210> 35 
<211> 3584 
<212> DNA 
<213> Unknown 

<220> 

<223> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 
pUC19 

<400> 35 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 2 40 
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 3 60 
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tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cggcgcgccg agctcctcga 420 

gcaaatttac acattgccac taaacgtcta aaccct-tgta atttgttttt gttttactat 480 

gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 

tttatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaatta 600 

tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 6 60 

tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 

gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 7 80 

taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaagg~tttag taatttttca 840 

agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 

ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 

■ 

ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 
atataatgag gattttgcaa tactttcatt. catacacact cactaagttt tacacgatta 1080 
taatttcttc atagccagca gatctgccgg catcgatccc gggccatggc ctgctttaat 1140 
gagatatgcg agacgcctat gatcgcatga tatttgcttt caattctgtt gtgcacgttg 1200 
taaaaaacct gagcatgtgt agctcagatc cttaccgccg gtttcggttc attctaatga 1260 
atatatcacc cgttactatc gtatttttat gaataatatt ctccgttcaa tttactgatt 1320 
gtccgtcgac gagctcggcg cgccaagctt ggcgtaatca tggtcatagc tgtttcctgt 1380 
gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa 1440 
agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc 1500 
tttccagtcg ggaaacctgt cgtgccagct. gcattaatga atcggccaac gcgcggggag 1560 
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aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 1620 
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 1680 
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 1740 
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 1800 
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 1860 
tccccctgga agctccctcg tgcgctctcc tg-ttccgacc ctgccgctta ccggatacct 1920 
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 1980 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 2040 
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 2100 
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 2160 
tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 2220 
ctgcgctctg ctgaagccag ttaccttcgg -aaaaagagtt ggtagctctt gatccggcaa 2280 
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 2340 
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 2400 
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 2460 
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 2520 
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 2580 
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 2640 
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 2 700 
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat. 2 760 
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ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 2820 
caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 2 880 
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 2940 
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 3000 

■ 

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 3060 
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 3120 
ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctt.taaaagt 3180 
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 32 40 

m 

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 3300 
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 3360 
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 3420 
gggttattgt ctcatgagcg gatacatatt. tgaatgtatt tagaaaaata aacaaatagg 34 80 
ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 3540 
gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtc 3584 

<210> 36 
<211> 4507 
<212> DNA 
<213> Unknown 

<220> 

<223> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 
pUC19 



0093/00064 CA 02485060 2004-10-28 

96 

<400> 36 . 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cggcgcgccg agctcctcga 420 

gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 

gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 

ttrtatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaatta 600 

tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 660 

tactatagga gaattaaagt gagtgaatat ^ggtaccacaa ggtttggaga tttaattgtt 720 

gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 

taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 

agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 

ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 

ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 

atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 

taatttcttc atagccagcc caccgcggtg ggcggccgcc tgcagtctag aaggcctcct 1140 

gctttaatga gatatgcgag acgcctatga tcgcatgata tttgctttca attctgttgt 1200 
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gcacgttgta aaaaacctga gcatgtgtag ctcagatcct taccgccggt ttcggttcat 1260 
tctaatgaat atatcacccg ttactatcgt atttttatga ataatattct ccgttcaatt 1320 
tactgattgt ccgtcgagca aatttacaca ttgccactaa acgtctaaac ccttgtaatt 1380 
tgtttttgtt ttactatgtg tgttatgtat ttgatttgcg ataaattttt atatttggta 1440 
ctaaatttat aacacctttt atgctaacgt ttgccaacac ttagcaattt gcaagttgat 1500 

taattgattc taaattattt ttgtcttcta aatacatata ctaatcaact ggaaatgtaa 1560 

■ 

atatttgcta atatttctac tataggagaa ttaaagtgag tgaatatggt accacaaggt 1620 
ttggagattt aattgttgca atgctgcatg gatggcatat acaccaaaca ttcaataatt 1680 
cttgaggata ataatggtac cacacaagat ttgaggtgca tgaacgtcac gtggacaaaa 1740 
ggtttagtaa tttttcaaga caacaatgtt accacacaca agttttgagg tgcatgcatg 1800 
gatgccctgt ggaaagttta aaaatatttt ggaaatgatt tgcatggaag ccatgtgtaa 1860 

m 

aaccatgaca tccacttgga ggatgcaata.atgaagaaaa ctacaaattt acatgcaact 1920 

agttatgcat gtagtctata taatgaggat tttgcaatac tttcattcat acacactcac 1980 

taag-ttttac acgattataa tttct/tcata gccagcggat ccgatatcgg gcccgctagc 204 0 

gttaaccctg ctttaatgag atatgcgaga cgcctatgat cgcatgatat ttgctttcaa 2100 

ttctgttgtg cacg*ttgtaa aaaacctgag catgtgtagc tcagatcctt accgccggtt 2160 

tcggttcatt ctaatgaata tatcacccgt tactatcgta tttttatgaa taatattctc 2220 

cgttcaattt actgattgtc cgtcgacgaa ttcgagctcg gcgcgccaag cttggcgtaa 2280 

tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 2340 

cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 2400 
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attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 2460 
tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 2520 
ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 2580 
gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 2640 
ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 2700 
cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 2760 
ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 2820 

* 

accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 2880 
catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 2940 
gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 3000 
tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 3060 
agagcgaggt atgtaggcgg tgctacagag, ttcttgaagt ggtggcctaa ctacggctac .3120 
actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 3180 
gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 3240 
aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 3300 
gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 3360 
aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 3420 
atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 3480 
gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 3540 
atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga cccacgctca 3600 
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ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 3660 

cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 3720 

agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 3780 

cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 3840 

tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga 3900 

agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact 3960 

gtzcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 4020 

gaatag-tgta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 4080 

ccacatagca gaactt/taaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc 4140 

tcaaggatct taccgctgtt. gagatccagt. tcgatgtaac ccactcg-tgc acccaactga 4200 

■ 

tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat 4260 
gccgcaaaaa agggaataag ggcgacacgg. aaatgttgaa tactcatact cttccttttt 4320 
caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 4380 
atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgac 4440 
gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat cacgaggccc 4500 
tttcgtc 4507 

<210> 37 
<211> 5410 
<212> DNA 
<213> Unknown 



<220> 
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<223> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 
PUC19 

<400> 37 

ttttggaaat gatttgcatg gaagccatgt gtaaaaccat gacatccact tggaggatgc 60 
aataatgaag aaaactacaa atttacatgc aactagttat gcatgtagtc tatataatga 120 
ggattttgca atactttcat tcatacacac tcactaagtt ttacacgatt ataatttctt 180 
catagccagc ggatccgata tcgggcccgc tagcgttaac cctgctttaa tgagatatgc 240 
gagacgccta tgatcgcatg atatttgctt tcaattctgt tgtgcacgtt gtaaaaaacc 300 
tgagcatgtg tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac 360 
ccgttactat cgtattttta tgaataatat tctccgttca atttactgat tgtccgtcga 420 
gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 
gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 
tttatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaatta 600 
tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 660 
tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 
gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 
taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 
agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 
ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 
ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 
atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 
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taatttcttc atagccagca gatctgccgg catcgatccc gggccatggc ctgctttaat 1140 
gagatatgcg agacgcctat gatcgcatga tatttgcttt caattctgtt gtgcacgttg 1200 
taaaaaacct gagcatgtgt agctcagatc cttaccgccg gtttcggttc attctaatga 1260 
atatatcacc cgttactatc gtatttttat gaataatatt ctccgttcaa tttactgatt 1320 

* 

gtccgtcgac gagctcggcg cgccaagctt ggcgtaatca tggtcatagc tgtttcctgt 1380 
gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa 1440 
agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc 1500 
tttccagtcg ggaaacctgt cgtgccagct gcat/taatga atcggccaac gcgcggggag 1560 
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 1620 
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 1680 
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 1740 
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 1800 
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 1860 
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 1920 
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 1980 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 2040 
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 2100 
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 2160 
tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 2220 
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 2280 
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acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 2340 
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 2400 
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 2460 
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 2520 
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 2580 
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 2640 
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 2700 
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 2760 
ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 2820 
caacgttgt-t gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 2880 
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 2 940 
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 3000 
actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 3 060 
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 3120 
ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 31B0 
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 3240 
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 3300 
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 3360 
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 3420 
gqgttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 3480 



0093/00064 02485060 2004-10-29 

103 

ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 3540 
gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga 3600 
tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc 3660 
ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg 372 0 
ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccata tgcggtgtga 3780 
aataccgcac agatgcgtaa ggagaaaata ccgcatcagg cgccattcgc cattcaggct 3840 
gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agcrtggcgaa 3900 
agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg 3960 
ttgtaaaacg acggccagtg aattcggcgc gccgagctcc tcgagcaaat ttacacattg 4 020 
ccactaaacg tctaaaccct tgtaatttgt ttttgtttta ctatgtgtgt tatgtatttg 4080 
atttgcgata aatttttata tttggtacta aatttataac accttttatg ctaacgtttg 4140 
ccaacactta gcaatttgca agttgattaa. ttgattctaa attatttttg tcttctaaat 4200 
acatatacta atcaactgga aatgtaaata tttgctaata tttctactat aggagaatta 4260 
aagtgagtga atatggtacc acaaggtttg gagatttaat tgttgcaatg ctgcatggat 4320 
ggcatataca ccaaacattc aataattctt gaggataata atggtaccac acaagatttg 4380 
aggtgcatga acgtcacgtg gacaaaaggt ttagtaattt ttcaagacaa caatgttacc 4440 
acacacaagt. tttgaggtgc atgcatggat gccctgtgga aag-tttaaaa atattttgga 4500 
aatgatttgc atggaagcca tgtgtaaaac catgacatcc acttggagga tgcaataatg 4560 
aagaaaacta caaatttaca tgcaactagt tatgcatgta gtctatataa tgaggatttt 4620 
gcaatacttt cattcataca cactcactaa gttttacacg attataattt cttcatagcc 4680 
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agcccaccgc ggtgggcggc cgcctgcagt ctagaaggcc tcctgcttta atgagatatg 4740 

cgagacgcct atgatcgcat gatatttgct ttcaattctg ttgtgcacgt tgtaaaaaac 4800 

» 

ctgagcatgt gtagctcaga tccttaccgc cggtttcggt tcattctaat gaatatatca 48 60 

cccgttacta tcgtattttt atgaataata ttctccgttc aatttactga ttgtccgtcg 49 20 

agcaaattta cacattgcca ctaaacgtct aaacccttgt aatttgtttt tgttttacta 4 980 

tgtgtgttat gtatttgatt tgcgataaat ttttatattt ggtactaaat ttataacacc 5040 

ttttatgcta acgtttgcca acacttagca atttgcaagt tgattaattg attctaaatt 5100 

atttttgtct tctaaataca tatactaatc aactggaaat gtaaatattt gctaatattt 5160 

ctactatagg agaattaaag tgagtgaata tggtaccaca aggtttggag at.ttaatt.gt 5220 

tgcaatgctg catggatggc atatacacca aacattcaat aattcttgag gataataatg 5280 

gtaccacaca agatttgagg tgcatgaacg tcacgtggac aaaaggttta gtaatttttc 5340 

aagacaacaa tgttaccaca cacaagtttt. gaggtgcatg catggatgcc ctgtggaaag 5400 
tttaaaaata 5410 

<210> 38 
<211> 12093 
<212> DNA 
<213> Unknown 

*<220> 

<223> Plant expression vector with a 

promoter-terminator expression cassette 

<400> 38 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
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gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 
tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 
ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 
atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 
ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 
tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 
gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

♦ 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 
cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 
ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 
ccggcacgcg accgggcgca ccgcagatgg. aaacggccga cgcgcagctt cgcttcctct 720 
gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 7 80 
ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 
ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 
ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 
ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 
ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 
gctcactgac tcgc-tgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 
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ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaatfa tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 
ccgcccccct. gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 
ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 162 0 
tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

■ 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 
ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 180 0 
ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 
agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 192 0 
aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 
aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 
gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 
aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 
gacgctcacc gggctgg-ttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 
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cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 
gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2 700 
tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 

i 

tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg.cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 

■ 

m 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
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attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 444 0 
tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 
aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 
cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 
aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 . 
gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 
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gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 492 0 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 
aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 504 0 
ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 510 0 
gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 
cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 52 20 
gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 
gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 
ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 
ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 
caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 
gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 
cggccagacg gtctacggga ccgacttcat, tgccgataag gtggattatc tggacaccaa 5640 
ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 
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cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactgg-tgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 

* 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 666 0 
ggcccgacgg atgttcgact atttcagctc . gcaccgggag ccgtacccgc tcaagctgga 6720 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6 780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 
tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7 020 
ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 
cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 
ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 
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aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 
ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 
cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 744 0 
tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 
acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctaggtagc 7560 
ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7620 
gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 
gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccag-t agctttagtg 7800 
tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 
ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 
acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 
catcaggccg acagtcggaa cttcgggtcc ccgaccrtgta ccattcggtg agcaatggat 8040 
aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 
cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 83 4 0 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 84 00 
gcctgtatcg agtggtgatt ttg-tgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 



0093/00064 CA 02485060 2004-10-28 

112 

tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 
gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 
tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 
gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 

p 

i 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc- agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 
tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 
agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 
ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
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agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctigatc gacaagaccg 9720 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9 840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 
tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 
agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 

* 

* 

cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 102 00 
tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 
agtggagcat ttttgacaag aaatatttgc. tagctgatag tgaccttagg cgacttttga 10320 
acgcgcaata atggtttct-g acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 
gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 105 00 
gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
cttaccagag ggcgccccag ctggcaattc cggttcgcrtt gctgtccata aaaccgccca 10620 
gtctagctat cgccatgtaa gcccactgca agcrtacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 
actggctttc tacgngttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 108 00 
gcgtgaagct, tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
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attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 
tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 
tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 
aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 
attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 
ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 
tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 
taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 
tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 
aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 
ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 
agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 
tatgcgagac gcctatgatc gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa 11640 
aaacctgagc atgtgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 
atcacccgtt actatcgtat ttttatgaat aatattctcc gttcaattta ctgattgtcc 11760 
gtcgacgaat tcgagctcgg cgcgcctcta gaggatcgat gaattcagat cggctgagtg 11820 
gctccttcaa cgttgcggtt ctgtcagttc caaacgtaaa acggcttgtc ccgcgtcatc 11880 
ggcgggggtc ataacgtgac tcccttaatt ctccgctcat gatcagattg tcgtttcccg 11940 
ccttcagttt aaactatcag tgtttgacag gatatattgg cgggtaaacc taagagaaaa 12000 
gagcgtttat tagaataatc ggatatttaa aagggcgtga aaaggtttat ccttcgtcca 12060 
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tttgtatgtg catgccaacc acagggttcc cca ' 12093 

<210> 39 
<211> 12085 
<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with a 

promoter-terminator expression cassette 

<4*00> 39 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 24 0 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 



0093/00064 ca 02 

ccgttgaaca ggctccgctc tcgccgctgt 
ccggtccgga cgcagcgttc gagcagggac 
ggaggctcgt tgtcaggaac gttgaaggac 
tgccggagcg caacccactc actacagcag 
ccaccgcgtc agacgcccgt agcagcccgc 
ccaagcctca cggccgcgct cggcctctct 
gctcactgac tcgctgcgct cggtcgttcg 
ggcggtaata cggttatcca cagaatcagg 
aggccagcaa aaggccagga accgtaaaaa 
ccgcccccc-t gacgagcatc acaaaaatcg 
aggactataa agataccagg cgtttccccc 
gaccctgccg cttaccggat acctgtccgc 
ccgctgcata accctgcttc ggggtcatta 
tcgcacgata tacaggattt tgccaaaggg 
ggcgtcagcc gggcaggata ggtgaagtag 
ctgtccctta ttcgcacctg gcggtgctca 
ctaccgccgg cgtaacagat gagggcaagc 
agggcagccc acctatcaag gtgtactgcc 
aggcggcggc ggccggcatg agcctgtcgg 
aaatcacggg cgtcgtggac tatgagcacg 
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tgcgggccgc gatagacgcc ttcgacgaag 900 
tcgcggtgat tgtcgatgga ttggcgaaaa 960 
cgagaaaggg tgacgattga tcaggaccgc 1020 
agccatgtag acaacatccc ctcccccttt 1080 
tacgggcttt ttcatgccct gccctagcgt 1140 
ggcggccttc tggcgctctt. ccgcttcctc 1200 
gctgcggcga gcggtatcag ctcactcaaa 1260 
ggataacgca ggaaagaaca tgtgagcaaa 1320 
ggccgcgttg ctggcgtttt tccataggct 1380 
acgctcaagt cagaggtggc gaaacccgac 1440 
tggaagctcc ctcgtgcgct ctcctgttcc 1500 
ctttctccct tcgggaagcg tggcgctttt 4560 
tagcgatttt ttcggtatat ccatcctttt 1620 
ttcgtgtaga ctttccttgg tgtatccaac 1680 
gcccacccgc gagcgggtgt tccttcttca 1740 
acgggaatcc tgctctgcga ggctggccgg 1800 
ggatggctga tgaaaccaag ccaaccagga 1860 
ttccagacga acgaagagcg attgaggaaa 1920 
cctacctgct ggccgtcggc cagggctaca 1980 
tccgcgagct ggcccgcatc aatggcgacc 2040 
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« 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 
gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 
aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

» 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 24 60 
cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgag-tt tcccacagat 2640 

* 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 27 00 
tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
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cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3 840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 42 60 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 432 0 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4 440 
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tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 45 60 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 



gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agcrtgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgcgagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 52 80 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 53 40 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 
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ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 
cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 
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tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 
ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 

cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 

« 

ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga gga.cggcccc 7260 

aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 

ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7 380 

cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 

tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 

acggtaggcg ctgtgcagcc gctgatgg-tc , gtgttcatct ctgccgctct gctaggtagc 7560 

ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 762 0 

gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 

gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 

acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 

tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 

ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 

acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 

catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 
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aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 
cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 82 80 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 
gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 
tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

« 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
agaatagccc gagatagggt tgagtgttgt. tccagtttgg aacaagagtc cactattaaa -8 7 60 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaa-tcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8 940 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
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tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 
tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 
agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 
ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
ag-ttcggctg gcgcgagccc ctgatgctct: tcgtccagat catcctgatc gacaagaccg 9720 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 
tcccttcccg cttcagtgac aacgtcgagc. acagctgcgc aaggaacgcc cgtcgtggcc -9960 
agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 
agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 
acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt, gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 
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gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 
gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 
gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 
actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 
gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 
tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 
tt-tgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 
aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 
attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat -11160 
ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 
tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 
taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 
tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 
aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 114 60 
ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 1152 0 
agccagcgga tccgatatcg ggcccgctag cgttaaccct gctttaatga gatatgcgag 11580 
acgcctatga tcgcatgata tttgctttca attctgttgt gcacgttgta aaaaacctga 11640 
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gcatgtgtag ctcagatcct taccgccggt ttcggttcat tctaatgaat atatcacccg 11700 
ttactatcgt atttttatga ataatattct ccgttcaatt tactgattgt ccgtcgacga 11760 
attcgagctc ggcgcgcctc tagaggatcg atgaattcag atcggctgag tggctccttc 11820 



aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg tcccgcgtca tcggcggggg 11880 
tcataacgtg actcccttaa ttctccgctc atgatcagat tgtcgtttcc cgccttcagt 1194 0 
ttaaactatc agtgtttgac aggatatatt ggcgggtaaa cctaagagaa aagagcgttt 12000 
attagaataa tcggatattt aaaagggcgt gaaaaggttt atccttcgtc catttgtatg 12060 
tgcatgccaa ccacagggtt cccca 120 85 



<210> 40 
<211> 12079 
<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with a 

promoter- terminator expression cassette 

<400> 40 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 
tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 
ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 
atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 
ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 
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tgacaaagtt gcagccgaat acagtgatcc 
gcgtagacgg tctgacgaca cgcaaactgg 
tttactggca cttcaggaac aagcgggcgc 
cggagaatca tacgcattcg gtgccgagag 
ggaatgcccg cagcttcagg caggcgctgc 
ccggcacgcg accgggcgca ccgcagatgg 
gcgaggcggg tttttcggcc ggggacgccg 
ctgttggggc cgtgcttgag gagcaggccg 
ccgttgaaca ggctccgctc tcgccgctgt 
ccggtccgga cgcagcgttc gagcagggac 
ggaggctcgt, tgtcaggaac gttgaaggac 
■tgccggagcg caacccactc actacagcag 
ccaccgcgtc agacgcccgt agcagcccgc 
ccaagcctca cggccgcgct. cggcctctct 
gctcactgac tcgctgcgct cggtcgttcg 
ggcggtaata cggttatcca cagaatcagg 
aggccagcaa aaggccagga accgtaaaaa 
ccgcccccct gacgagcatc acaaaaatcg 
aggactataa agataccagg cgtttccccc 
gaccctgccg ct/taccggat, acctgtccgc 
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■ 

gtgccgccct ggacctgttg aacgaggtcg 420 
cggaacggtt gggggttcag cagccggcgc 480 
tgctcgacgc actggccgaa gccatgctgg 540 
ccgacgacga ctggcgctca tttctgatcg 600 
tcgcctaccg cgatggcgcg cgcatccatg 660 
aaacggccga cgcgcagctt cgcttcctct 720 
tcaatgcgfct gatgacaatc agctacttca 780 
gcgacagcga tgccggcgag cgcggcggca 840 
tgcgggccgc gatagacgcc ttcgacgaag 900 
tcgcggtgat tgtcgatgga ttggcgaaaa 960 
cgagaaaggg tgacgattga tcaggaccgc 2020 
agccatgtag acaacatccc ctcccccttt 1080 

- 

tacgggcttt ttcatgccct gccctagcgt 1140 
ggcggccttc tggcgctctt ccgcttcctc 1200 
gctgcggcga gcggtatcag ctcactcaaa 1260 
ggataacgca ggaaagaaca tgtgagcaaa 1320 
ggccgcgttg ctggcgtttt tccataggct 1380 
acgctcaagt cagaggtggc gaaacccgac 1440 
tggaagctcc ctcgtgcgct ctcctgttcc 1500 
ctttc-tccct tcgggaagcg tggcgctttt 1560 
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ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggt'ata't ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 22 80 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 24 60 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2 520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct. cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2 700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2 7 60 
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gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

* 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
gtatgaaaac gagaattgga cctttacaga. attactctat gaagcgccat atttaaaaag 34 2 0 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 34 80 
tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 36 00 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 
gcagcttt.cc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
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atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 402 0 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacg-taaaat 414 0 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 432 0 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 

* • 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 
aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 
cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 
aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 
gaaaaggaca tgatgctatg gctggaagga aagctgcc-tg ttccaaaggt cctgcacttt 4860 
gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4 980 
aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 
ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 
gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 
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cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 
gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtetacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 

ggtcattttc cacgtcaaca aggacgtgaa gatxacctac accggcg-tcg agctgcgggc 6300 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 63 60 
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cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 
cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct. gggtcaatga 6840 
tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6 960 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 
ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

* • 

cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 714 0 
cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 
ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 72 60 
aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 
ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 
cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 
tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 
acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctaggtagc 7560 
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ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggccft ggcgctgttg 7620 
gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7 680 
gtttccatgg cgttcggaac cg-tgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 
tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 
ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 
acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 

< 

aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccacrtca gcttcctcag 8100 
cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg. ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctg-tcatc gttacaatca acatgctacc ctccgcgaga 8280 

+ 

tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 

• ■ 

gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 

•tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 

gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 

agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 
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gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 
tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 
agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 
ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 

■ 

tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 
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agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgg'a caggtcggtc 10020 

ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 

ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 

cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 

tctggattga gagtgaatat. gagactctaa ttggataccg aggggaattt atggaacgtc 10260 

agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 

acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 

tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 

gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 

gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 

cttaccagag ggcgccccag ctggcaattc - cggttcgctt gctgtccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt. 10680 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 1074 0 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgag-t gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 
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ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 112 80 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 114 60 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagcaga tctgccggca tcgatcccgg gccatggcct gctttaatga gatatgcgag 115 80 

■ 

acgcctatga tcgcatgata tttgctttca attctgttgt gcacgttgta aaaaacctga 11640 

» < 

♦ 

gcatgtgtag ctcagatcct taccgccggt ttcggttcat tctaatgaat atatcacccg 11700 
ttactatcgt atttttatga ataatattct ccgttcaatt tactgattgt ccgtcgacga 11760 
gctcggcgcg cctctagagg atcgatgaat .tcagatcggc tgagtggctc cttcaacgtt 11820 
gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc gtcatcggcg ggggtcataa 11880 
cgtgactccc ttaattctcc gctcatgatc agattgtcgt ttcccgcctt cagtttaaac 11940 
tatcagtgtt tgacaggata tattggcggg taaacctaag agaaaagagc gtttattaga 12000 
ataatcggat atttaaaagg gcgtgaaaag gtttatcctt cgtccatttg tatgtgcatg 12060 
ccaaccacag ggttcccca 12079 

<210> 41 
<211> 13002 
<212> DNA 
<213> Unknown 

<220> 

<2 2 3> Plant expression vector with two 
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promoter-terminator expression cassettes 
<400> 41 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 24 0 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 30 0 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 54 0 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 

ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 

ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 
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ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 
ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 162 0 
tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 
ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 
ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 
ctaccgccgg cgtaacagat gagggcaagc. ggatggctga tgaaaccaag ccaaccagga 1860 
agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 
aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 
aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 
gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 
aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcetttgc 2340 
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gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2 400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 
cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 258 0 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 
gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 
tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
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ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg. ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 
tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 
aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 
cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 
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aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 
gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 
gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 
aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 
ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 
gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 
cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 
gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 
gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 
ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 
ctggatgaat tgttttagta cctagatgtg ,gcgcaacgat gccggcgaca agcaggagcg 5460 
caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 
gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 
cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 
ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 



0093/00064 02485060 2004-10-28 

141 

# 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggett 6480 
cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc fcg-tcgccgac .6 660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6 840 
tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 
ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 
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cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 
ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 
aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7 320 
ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 
cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 
tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 
acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctaggtagc 7560 
ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgc-tgt/tg 7620 
gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 
gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 
tttgatccgc caatcccgat gcctacagga. accaatgttc tcggcctggc gtggctcggc -7860 
ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7 920 
acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7 980 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 
aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 
cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 
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gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 
gacgtttt-ta atgtactggg gtggtttttc ftttcaccag tgagacgggc aacagctgat 8580 
tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8 640 
gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 

■ 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc. agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 
tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 
agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 
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ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 
tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 
agccacgata gccgcgctgc etcgtectgc agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct gttgtgccca gtcatagccg aa-tagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
tctggattga gagtgaatat gagaetctaa. ttggataccg aggggaattt atggaacgtc -10260 
agtggagca-t ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 1032 0 
acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttg-tcccgc 10440 
gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 
gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 
gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 



0093/00064 ^ 02485060 2004-10-28 



145 



actggctttc tacgtgt.t.cc gcttcctt/ta gcagcccttg cgccctgagt, gcttgcggca 10800 
gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctccrtcgagc aaatttacac 10860 
attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 
tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 
tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtc-ttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

« 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

* 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 
tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 
taccacacac aagt-tttgag g-tgca-tgca-t ggatgccctg tggaaagttt aaaaatattt 1134 0 
tggaaatgat. ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 
aatgaagaaa actacaaatt tacatgcaac. tagttatgca tgtagtctat ataatgagga 11460 
ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 
agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 
tatgcgagac gcctatgatc gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa 11640 
aaacctgagc atgtgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 
atcacccgtt actatcgtat ttttatgaat aatattctcc gttcaattta ctgattgtcc 11760 
gtcgagcaaa tttacacatt gccactaaac gtctaaaccc ttgtaatttg tttttgtttt 11820 
actatgtgtg ttatgtat-ft gatttgcga-t aaatttttat atttggtact aaatttataa 11880 
caccttttat gctaacgttt. gccaacactt agcaatttgc aagttgatta attgattcta 11940 
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aattattttt gtcttctaaa tacatatact aatcaactgg aaatgtaaat atttgctaat 12000 
atttctacta taggagaatt aaagtgagtg aatatggtac cacaaggttt ggagatttaa 12060 
ttgttgcaat gctgcatgga tggcatatac accaaacatt caataattct tgaggataat 12120 
aatggtacca cacaagattt gaggtgcatg aacgtcacgt ggacaaaagg tttagtaatt 12180 
tttcaagaca acaatgttac cacacacaag ttttgaggtg catgca-tgga tgccctgtgg 12240 
aaagtttaaa aatattttgg aaatgatttg catggaagcc atgtgtaaaa ccatgacatc 12300 
cacttggagg atgcaataat gaagaaaact acaaatttac atgcaactag ttatgcatgt 12360 
agtctatata atgaggattt tgcaatactt tcattcatac acactcacta agttttacac 1242 0 
gattataatt tcttcatagc cagcggatcc gatatcgggc ccgctagcgt taaccctgct 12480 
ttaatgagat atgcgagacg cctatgatcg catgatattt gctt/tcaatt. ctgttgtgca 12540 
cgttgtaaaa aacctgagca tgtgtagctc agatccttac cgccggtttc ggttcattc-fc 12600 
aatgaatata tcacccgtta ctatcgtatt. tttatgaata atattctccg ttcaatttac 12660 
tgattgtccg tcgacgaatt cgagctcggc gcgcctctag aggatcgatg aattcagatc 12720 
ggctgagtgg ctccttcaac gttgcggttc tgtcagttcc aaacgtaaaa cggcttgtcc 12780 
cgcgtcatcg gcgggggtca taacgtgact cccttaattc tccgctcatg atcagattgt, 12840 
cgtttcccgc cttcagttta aactatcagt gtttgacagg atatattggc gggtaaacct 12900 
aagagaaaag agcgtttatt agaataatcg gatatttaaa agggcgtgaa aaggtttatc 12960 
cttcgtccat ttgtatgtgc atgccaacca cagggttccc ca 13002 

<210> 42 
<211> 13905 
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<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with three 

promoter-terminator expression cassettes 

<400> 42 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 
tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 
ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 2 40 
atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 
ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 
tgacaaagtt gcagccgaat acagtgatcc . gtgccgccct ggacctgttg aacgaggtcg 420 
gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 
tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 
cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 
ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 
ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 
gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 
ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 
ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 
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ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 
ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt. ttcatgccct gccctagcgt 1140 
ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

■ ■ 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 
ccgctgcata accctgcttc ggggtcatta. tagcgatttt ttcggtatat ccatcctttt 1620 
tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 
ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 17 40 
ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 
ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 
agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 
aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 
aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 
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gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 
aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 
gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 
cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

■ 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 
gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 
tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac. aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2 880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
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gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccait atttaaaaag 3420 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 34 80 
tactgataag ataatatatc tt-ttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggca-t aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgact-t ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
atacgtgcgc aacaaccgtc ttccggagac. tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4 200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4 260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 
tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 
aaaagctg-tt. ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 
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cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaaga^g aaggaaataa 4620 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 
aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 
gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 
gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 
aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 504 0 
ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 
gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 
cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 
gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 
gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 
ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 
ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 
caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 
gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 
cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 
ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
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cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaceg tcatgcgtgc 5 820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6 360 
cgagccgatc accttcacgt tctacgagct. ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 
cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 
tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
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tcagtatcgc tcgggacgca cggcgcgctc 
ttgacaattg tgattaaggc tcagattcga 
cgcgagatcc gattgtcggc cctgaagaaa 
cacgaggaga aaaagcccat ggaggcgttc 
ggcgcctaca tcgacggcga gatcattggg 
aaggacgctc acaaggcgca tctgtccggc 
gg'ggtcgccg gtatgctgct gcgggcgttg 
cgacagattc caacgggaat ctggtggatg 
tcgctattct ggagcttgtt gtttatttcg 
acggtaggcg ctgtgcagcc gctgatggtc 
ccgatacgat tgatggcggt cctgggggct 
gtgttgacac caaacgcagc gctagatcct 
gtttccatgg cgttcggaac cgtgctgacc 
acctttaccg cctggcaact ggcggccgga 
tttgatccgc caatcccgat gcctacagga 
ctgatcggag cgggtttaac ctacttcctt 
acagttgttt ccttactggg ctttctcagc 
catcaggccg acagtcggaa cttcgggtcc 
a 99Sgagttg atatcgtcaa cgttcacttc 
cggctttatc cagcgatttc ctattatgtc 
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tacgaactgc cgataaacag aggattaaaa 7 020 
cggcttggag cggccgacgt gcaggatttc 7080 
gctccagaga tgttcgggtc cgtttacgag 7140 
gctgaacggt tgcgagatgc cgtggcattc 7200 
ctgtcggtct tcaaacagga ggacggcccc 7260 
gttttcgtgg agcccgaaca gcgaggccga 7320 
ccggcgggtt tattgctcgt gatgatcgtc 7380 
cgcatcttca tcctcggcgc acttaatatt 7440 
gtctaccgcc tgccgggcgg ggtcgcggcg 7500 
gtgttcatct ctgccgctct gctaggtagc 7560 
atttgcggaa ctgcgggcgt ggcgctgttg 7620 
gtcggcgtcg cagcgggcct ggcgggggcg 7680 
cgcaagtggc aacctcccgt gcctctgctc 7740 
ggacttctgc tcgttccagt agctttagtg 7800 
accaatgttc tcggcctggc gtggctcggc 7860 
tggttccggg ggatctcgcg actcgaacct 7920 
cccagatctg gggtcgatca gccggggatg 7980 
ccgacctgta ccattcggtg agcaatggat 8040 
taaagaaata gcgccactca gcttcctcag 8100 
ggcatagttc tcaagatcga cagcctgtca 8160 
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cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat B340 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 852 0 
gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 
tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 
gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 876 0 
gaacgtggac tccaacgtca aagggcgaa* aaccgtctat cagggcgatg gcccactacg 882 0 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 

* » 

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 894 0 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 912 0 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 
tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 936 0 
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agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 954 0 
-ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 

* 

gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 

■ • 

gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 
tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 
agccacgata gccgcgctgc ctcgtcctgc. agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 102 00 
tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 
agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 
acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 
gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 
gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
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cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccaia aaaccgccca 10620 
gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 
actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 
gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 
tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 
tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 
aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 
attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 
ggatggcata tacaccaaac attcaataat. tcttgaggat aataatggta ccacacaaga 11220 
tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 

* 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 
tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 
aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 
ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 
agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 
tatgcgagac gcctatgatc gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa 11640 
aaacctgagc atgtgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 
atcacccgtt actatcgtat ttttatgaat aatattctcc gttcaattta ctgattgtcc 11760 
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gtcgagcaaa tttacacatt gccactaaac gtctaaaccc ttgtaatt£g tttttgtttt 11820 

actatgtgtg ttatgtattt gatttgcgat aaatttttat atttggtact aaatttataa 11880 

caccttttat gctaacgttt gccaacactt agcaatttgc aagttgatta attgattcta 11940 

aattattttt gtcttctaaa tacatatact aatcaactgg aaatgtaaat atttgctaat 12000 

atttctacta taggagaatt aaagtgagtg aatatggtac cacaaggttt ggagatttaa 12060 

ttgttgcaat gctgcatgga tggcatatac accaaacatt caataattct tgaggataat 12120 

aatggtacca cacaagattt gaggtgcatg aacgtcacgt ggacaaaagg tttagtaatt 12180 

tttcaagaca acaatgttac cacacacaag ttttgaggtg catgcatgga tgccctgtgg 12240 

aaagtttaaa aatattttgg aaatgatttg catggaagcc atgtgtaaaa ccatgacatc 12300 

cacttggagg atgcaataat gaagaaaact acaaatttac atgcaactag ttatgcatgt 12 360 

■ 

agtctatata atgaggattt tgcaatactt. tcattcatac acactcacta agttttacac 12420 

gattataatt tcttcatagc cagcggatcc gatatcgggc ccgctagcgt taaccctgct 12480 

• ■ 

ttaatgagat atgcgagacg cctatgatcg catgatattt gctttcaatt ctgttgtgca 12540 

cgttgtaaaa aacetgagca tgtgtagctc agatccttac cgccggtttc ggttcattct 12 600 

aatgaa-tata tcacccgtta ctatcgtatt tttatgaata atattctccg ttcaatttac 12660 

tgattgtccg tcgagcaaat ttacacattg ccactaaacg tctaaaccct tgtaatttgt 12720 

ttttgtttta ctatgtgtgt tatgtatttg atttgcgata aatttttata tttggtacta 12780 

aatttataac accttttatg ctaacgtttg ccaacactta gcaatttgca agttgattaa 12840 

ttgattctaa attatttttg tcttctaaat acatatacta atcaactgga aatgtaaata 12 900 

tttgctaata tttcrtactat aggagaatta aagtgagtga atatggtacc acaaggtttg 12960 



0093/00064 ^ 02485060 2004-10-28 

158 

gagatttaat tgttgcaatg ctgcatggat ggcatataca ccaaacattc aataattctt 13020 
gaggataata atggtaccac acaagatttg aggtgcatga acgtcacgtg gacaaaaggt 13080 
ttagtaattt ttcaagacaa caatgttacc acacacaagt tttgaggtgc atgcatggat 13140 
gccctgtgga aagtttaaaa atattttgga aatgatttgc atggaagcca tgtgtaaaac 13200 
catgacatcc acttggagga tgcaataatg aagaaaacta caaatttaca tgcaactagt 13260 
tatgcatgta gtctatataa tgaggatttt gcaatacttt cattcataca cactcactaa 13320 
gttttacacg attataattt cttcatagcc agcagatctg ccggcatcga tcccgggcca 133 80 
tggcctgctt taatgagata tgcgagacgc ctatgatcgc atgatatttg ctttcaattc 13440 

* 

* 

tgttgtgcac gttgtaaaaa acctgagcat gtgtagctca gatccttacc gccggtttcg 13500 
gttcattcta atgaatatat cacccgttac tatcgtattt ttatgaataa tattctccgt 13560 
tcaatttact gattgtccgt cgacgagctc ggcgcgcctc tagaggatcg atgaattcag 13620 
atcggctgag tggctccttc aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg 13680 
tcccgcgtca tcggcggggg tcataacgtg actcccttaa ttctccgctc atgatcagat 13740 
tgtcgtttcc cgccttcagt ttaaactatc agtgtttgac aggatatatt ggcgggtaaa 13800 
cctaagagaa aagagcgttt attagaataa tcggatattt aaaagggcgt gaaaaggttt 13860 
atccttcgtc catttgtatg tgcatgccaa ccacagggtt cccca 13905 

<210> 43 
<211> 15430 
<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with two 
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promoter-terminator expression cassettes, 

inserted is Physcomitrella patens elongase and desaturase 



<220> 
<221> CDS 

<222> (11543) . .(12415) 

<220> 
<221> CDS 

<222> (13313).. (14890) 
<400> 43 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 
tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 
ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 
atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 
ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca -360 



tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 
gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 
tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 
cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 
ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 
ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 
gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 
ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
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ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 
ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 
ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 10 80 
ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 
ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 13 BO 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
gaccctgccg cttaccggat acctgtccgc. ctttctccct tcgggaagcg tggcgctttt 1560 
ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 
tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 
ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 
ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 
ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 
agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 
aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 
aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
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tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 
gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 
aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 
gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 
cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 23 20 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 
gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 
tgacagatga ggggcgcgat ccttgacact. tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
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cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3 660 
attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgetgcgt atatcgcttg ctgattacgt 3840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 
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tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cct;gcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4 980 

aggctctttc actccatcga catatcggat tgtcccrtata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc. gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca g-tat.gtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 
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ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aaaaggtcag 6060 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6i20 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtgt ggcagcaggt. gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 64 80 
cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

* 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 
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tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6 900 
agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 
ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 
cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7 200 
ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 72 60 
aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 
ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 
cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 744 0 
tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 
acggtaggcg ctgtgcagcc gctgatggtc. gtgttcatct ctgccgctct gctaggtagc 7560 
ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7620 
gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 
gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 774 0 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 
tttga-tccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 
ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 
acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 804 0 
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aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 
cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 828 0 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 834 0 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg B520 
gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 
tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 
gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
agaatagccc gagatagggt tgagtgttgt. tccagtttgg aacaagagtc cactattaaa 8760 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
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tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 

tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 

agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 

» 

tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9 480 

tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 

ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 

» 

tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9 780 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 
tcccttcccg cttcagtgac aacgtcgagc. acagctgcgc aaggaacgcc cgtcgtggcc 3960 
agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 
agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 
acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 
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gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 
gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 
gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 



actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10 



800 



gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 



tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg l 



0980 



tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11 



040 



aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 111 



00 



attaaagtga gtgaatatgg taccacaagg. tttggagatt taattgttgc aatgctgcat -11160 
ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta coacacaaga 11220 
tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 



taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 113 
tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 114 



40 



00 



aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 
ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagccca ccgcggtgga aa atg gag gtc gtg gag aga ttc tac ggt gag 11572 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu 
1 5 io 
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ttg gat ggg aag gtc teg cag ggc gtg aat gca ttg ctg ggt agt ttt 11620 
Leu Asp Gly Lys Val Ser Gin Gly Val Asn Ala Leu Leu Gly Ser Phe 

15 20 25 

ggg gtg gag ttg acg gat acg ccc act acc aaa ggc ttg ccc etc gtt 11668 
Gly Val Glu Leu Thr Asp Thr Pro Thr Thr Lys Gly Leu Pro Leu Val 

30 35 40 

gac agt ccc aca ccc ate gtc etc ggt gtt tct gta tac ttg. act att 11716 
Asp Ser Pro Thr Pro lie Val Leu Gly Val Ser Val Tyr Leu Thr lie 
45 50 55 

gtc att gga ggg ctt ttg tgg ata aag gec agg gat ctg aaa ccg cgc 117 64 
Val lie Gly Gly Leu Leu Trp lie Lys Ala Arg Asp Leu Lys Pro Arg 
60 65 70 

gee teg gag cca ttt ttg etc caa get ttg gtg ctt gtg cac aac ctg 11812 
Ala Ser Glu Pro Phe Leu Leu Gin Ala Leu Val Leu Val His Asn Leu 
75 80 85 90 

ttc tgt ttt gcg etc agt ctg tat atg tgc gtg ggc ate get tat cag 11860 
Phe Cys Phe Ala Leu Ser Leu Tyr Met Cys Val Gly lie Ala Tyr Gin 

95 100 105 

get att acc tgg egg tac tct etc tgg ggc aat gca tac aat cct aaa 11908 
Ala lie Thr Trp Arg Tyr Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys 

110 115 120 

cat aaa gag atg gcg att ctg gta tac ttg ttc tac atg tct aag tac 11956 
His Lys Glu Met Ala lie Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr 
125 130 135 

gtg gaa ttc atg gat acc gtt ate atg ata ctg aag cgc age acc agg 12004 
Val Glu Phe Met Asp Thr Val lie Met lie Leu Lys Arg Ser Thr Arg 
140 145 150 

caa ata age ttc etc cac gtt tat cat cat tct tea att tec etc att 12052 
Gin He Ser Phe Leu His Val Tyr His His Ser Ser He Ser Leu He 
155 160 165 170 
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170 

tgg tgg get att get cat cac get cct ggc ggt gaa gca tat tgg tct 
Trp Trp Ala He Ala His His Ala Pro Gly Gly Glu Ala Tyr Trp Ser 

175 180 185 

gcg get ctg aac tea gga gtg cat gtt etc atg tat gcg tat tac ttc 
Ala Ala Leu Asn Ser Gly Val His Val Leu Met Tyr Ala Tyr Tyr Phe 

"0 195 200 

ttg get gee tgc ctt cga agt age cca aag tta aaa aat aag tac ctt 12196 
Leu Ala Ala Cys Leu Arg Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu 
205 210 215 

ttt tgg ggc agg tac ttg aca caa ttc caa atg ttc cag ttt atg ctg 12244 
Phe Trp Gly Arg Tyr Leu Thr Gin Phe Gin Met Phe Gin Phe Met Leu 
220 225 230 

aac tta gtg cag get tac tac gac atg aaa acg aat gcg cca tat cca 12292 
Asn Leu Val Gin Ala Tyr Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro 
235 240 245 250 

caa tgg ctg ate aag att ttg ttc tac tac atg ate teg ttg ctg ttt 12340 
Gin Trp Leu He Lys He Leu Phe Tyr Tyr Met He Ser Leu Leu Phe 

255 .260 265 

ctt ttc ggc aat ttt tac gta caa aaa tac ate aaa ccc tct gac gga 12388 
Leu Phe Gly Asn Phe Tyr Val Gin Lys Tyr He Lys Pro Ser Asp Gly 

270 275 280 

aag caa aag gga get aaa act gag tga tctagaaggc ctcctgcttt 12435 
Lys Gin Lys Gly Ala Lys Thr Glu 
285 290 

aatgagatat gcgagacgcc tatgatcgea tgatatttgc tttcaattct gttgtgcacg 12495 
ttgtaaaaaa cctgagcatg tgtagctcag atccttaccg ccggtttcgg ttcattctaa 12555 
tgaatatatc acccgttact ategtatttt tatgaataat attctcegtt caatttactg 12615 
attgtccgtc gagcaaattt acacattgcc actaaaegtc taaacccttg taatttgttt 12675 
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ttgttttact atgtgtgtta tgtatttgat ttgcgataaa tttttatatt tggtactaaa 12735 
tttataacac cttttatgct aacgtttgcc aacacttagc aatttgcaag ttgattaatt 12795 
gattctaaat tatttttgtc ttctaaatac atatactaat caactggaaa tgtaaatatt 12855 
tgctaatatt tctactatag gagaattaaa gtgagtgaat atggtaccac aaggtttgga 12915 
gatttaattg ttgcaatgct gcatggatgg catatacacc aaacattcaa taattcttga 12975 
ggataataat ggtaccacac aagatttgag gtgcatgaac gtcacgtgga caaaaggttt 13035 
agtaattttt caagacaaca atgttaccac acacaagttt tgaggtgcat gcatggatgc 13095 
cctgtggaaa gtttaaaaat attttggaaa tgatttgcat ggaagccatg tgtaaaacca 13155 
tgacatccac ttggaggatg caataatgaa gaaaactaca aatttacatg caactagtta 13215 
tgcatgtagt ctatataatg aggattttgc aatactttca ttcatacaca ctcactaagt 13275 

• * 

tttacacgat tataatttct tcatagccag cggatcc atg gta ttc gcg ggc ggt 13330 

Met Val Phe Ala Gly Gly 

295 

gga ctt cag cag ggc tct etc gaa gaa aac ate gac gtc gag cac att 1337 8 
Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn lie Asp Val Glu His lie 
300 305 310 

gec agt atg tct etc ttc age gac ttc ttc agt tat gtg tct tea act 1342 6 
Ala Ser Met Ser Leu Phe Ser Asp Phe Phe Ser Tyr Val Ser Ser Thr 
315 320 325 

gtt ggt teg tgg age gta cac agt ata caa cct ttg aag cgc ctg acg 1347 4 
Val Gly Ser Trp Ser Val His Ser lie Gin Pro Leu Lys Arg Leu Thr 
330 335 340 345 

agt aag aag cgt gtt teg gaa age get gee gtg caa tgt ata tea get 13522 
Ser Lys Lys Arg Val Ser Glu Ser Ala Ala Val Gin Cys He Ser Ala 

350 355 360 
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gaa gtt cag aga aat teg agt acc cag gga act gcg gag gca etc gca 13570 
Glu Val Gin Arg Asn Ser Ser Thr Gin Gly Thr Ala Glu Ala Leu Ala 

365 370 375 

gaa tea gtc gtg aag ccc acg aga cga agg tea tct cag tgg aag aag 13618 
Glu Ser Val Val Lys Pro Thr Arg Arg Arg Ser Ser Gin Trp Lys Lys 
380 385 390 

teg aca cac ccc eta tea gaa gta gca gta cac aac aag cca age gat 13666 
Ser Thr His Pro Leu Ser Glu Val Ala Val His Asn Lys Pro Ser Asp 
395 400 405 

tgc tgg att gtt gta aaa aac aag gtg tat gat gtt tec aat ttt gcg 13714 
Cys Trp He Val Val Lys Asn Lys Val Tyr Asp Val Ser Asn Phe Ala 
410 415 420 425 

gac gag cat ccc gga gga tea gtt att agt act tat ttt gga cga gac 13762 
Asp Glu His Pro Gly Gly Ser Val He Ser Thr Tyr Phe Gly Arg Asp 

430 435 440 

ggc aca gat gtt ttc tct agt ttt cat gca get tct aca tgg aaa att 13810 
Gly Thr Asp Val Phe Ser Ser Phe His Ala Ala Ser Thr Trp Lys He 

445 ASO 455 

ctt caa gac ttt tac att ggt gac gtg gag agg gtg gag ccg act cca 13 858 
Leu Gin Asp Phe Tyr He Gly Asp Val Glu Arg Val Glu Pro Thr Pro 
460 465 470 

gag ctg ctg aaa gat ttc cga gaa atg aga get ctt ttc ctg agg gag 13906 
Glu Leu Leu Lys Asp Phe Arg Glu Met Arg Ala Leu Phe Leu Arg Glu 
475 480 485 

caa ctt ttc aaa agt teg aaa ttg tac tat gtt atg aag ctg etc acg 13954 
Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr Val Met Lys Leu Leu Thr 
490 495 500 505 



aat gtt get att ttt get gcg age att gca ata ata tgt tgg age aag 
Asn Val Ala He Phe Ala Ala Ser He Ala He He Cys Trp Ser Lys 

510 515 520 



14002 
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act att tea gcg gtt ttg get tea get tgt atg atg get ctg tgt ttc 14050 
Thr lie Ser Ala val Leu Ala Ser Ala Cys Met Met Ala Leu Cys Phe 

525 530 535 

caa cag tgc gga tgg eta tec cat gat ttt etc cac aat cag gtg ttt 14098 
Gin Gin Cys Gly Trp Leu Ser His Asp Phe Leu His Asn Gin Val Phe 
540 545 550 

gag aca cgc tgg ctt aat gaa gtt gtc ggg tat gtg ate ggc aac gee 14146 
Glu Thr Arg Trp Leu Asn Glu Val Val Gly Tyr Val lie Gly Asn Ala 
555 560 565 

gtt ctg ggg ttt agt aca ggg tgg tgg aag gag aag cat aac ctt cat 14194 
Val Leu Gly Phe Ser Thr Gly Trp Trp Lys Glu Lys His Asn Leu His 
570 575 580 585 

cat get get cca aat gaa tgc gat cag act tac caa cca att gat gaa 14242 
His Ala Ala Pro Asn Glu Cys Asp Gin Thr Tyr Gin Pro lie Asp Glu 

590 595 600 

gat att gat act etc ccc etc att gee tgg age aag gac ata ctg gee 14290 
Asp lie Asp Thr Leu Pro Leu lie Ala Trp Ser Lys Asp lie Leu Ala 

605 610 615 

aca gtt gag aat aag aca ttc ttg cga ate etc caa tac cag cat ctg 14 338 
Thr Val Glu Asn Lys Thr Phe Leu Arg lie Leu Gin Tyr Gin His Leu 
620 625 630 

ttc ttc atg ggt ctg tta ttt ttc gee cgt ggt agt tgg etc ttt tgg 14386 
Phe Phe Met Gly Leu Leu Phe Phe Ala Arg Gly Ser Trp Leu Phe Trp 
635 640 645 

age tgg aga tat ace tct aca gca gtg etc tea cct gtc gac agg ttg 14434 
Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu Ser Pro Val Asp Arg Leu 
650 655 660 665 

ttg gag aag gga act gtt ctg ttt cac tac ttt tgg ttc gtc ggg aca 14482 
Leu Glu Lys Gly Thr Val Leu Phe His Tyr Phe Trp Phe Val Gly Thr 

670 675 680 
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gcg tgc tat ctt etc cct ggt tgg aag cca tta gta tgg atg gcg gtg 14530 

Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro Leu Val Trp Met Ala Val 

685 690 695 



act gag etc atg tec ggc atg ctg ctg ggc ttt gta ttt gta ctt age 14578 
Thr Glu Leu Met Ser Gly Met Leu Leu Gly Phe Val Phe Val Leu Ser 
700 705 710 



cac aat ggg 
His Asn Gly 
715 

cag ate gta 
Gin lie Val 
730 

ttc act ggt 
Phe Thr Gly 

atg ccc agg 
Met Pro Arg 

tgt aag aaa 
Cys Lys Lys 
780 

act tgc aag 
Thr Cys Lys 
795 



atg gag gtt 
Met Glu Val 

tec aca egg 
Ser Thr Arg 

735 

ggc ctt aac 
Gly Leu Asn 
750 

cat aat tta 
His Asn Leu 
765 

cac ggt ctg 
His Gly Leu 

gtt ttg aaa 
Val Leu Lys 



tat aat teg 
Tyr Asn Ser 
720 

gat ate aaa 
Asp lie Lys 

agg caa ata 
Arg Gin lie 

aac aaa ata 
Asn Lys lie 

770 

gtg tac gaa 
Val Tyr Glu 
785 

gca ttg aag 
Ala Leu Lys 
800 



tct aaa gaa 
Ser Lys Glu 
725 

gga aac ata 
Gly Asn lie 
740 

gag cat cat 
Glu His His 
755 

gca cct aga 
Ala Pro Arg 

gac gta tct 
Asp Val Ser 

gaa gtc gcg 
Glu Val Ala 
805 



ttc gtg agt 
Phe Val Ser 

ttc aac gac 
Phe Asn Asp 

ctt ttc cca 
Leu Phe Pro 
760 

gtg gag gtg 
Val Glu Val 
775 

att get ace 
He Ala Thr 

■ 

790 

gag get gcg 
Glu Ala Ala 



gca 14626 
Ala 



tgg 14674 

Trp 

745 

aca 14722 
Thr 



ttc 14770 
Phe 



ggc 14818 
Gly 



gca 14866 
Ala 



gag cag cat get ace acc agt taa getagegtta accctgcttt aatgagatat 14920 
Glu Gin His Ala Thr Thr Ser 
810 815 

gcgagacgcc tatgatcgea tgatatttgc tttcaattct gttgtgcacg ttgtaaaaaa 14980 
cctgagcatg tgtagctcag atccttaccg ccggtttcgg ttcattctaa tgaatatatc 15040 
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acccgttact atcgtatttt tatgaataat attctccgtt caatttactg attgtccgtc 15100 

gacgaattcg agctcggcgc gcctctagag gatcgatgaa ttcagatcgg ctgagtggct 15160 

ccttcaacgt tgcggttctg tcagttccaa acgtaaaacg gcttgtcccg cgtcatcggc 15220 

gggggtcata acgtgactcc cttaattctc cgctcatgat cagattgtcg tttcccgcct 15280 

tcagtttaaa ctatcagtgt ttgacaggat atattggcgg gtaaacctaa gagaaaagag 15340 

cgtttattag aataatcgga tatttaaaag ggcgtgaaaa ggtttatcct tcgtccattt 15400 

gtatgtgcat gccaaccaca gggttcccca 15430 

<210> 44 

<211> 290 

<212> PRT 

<213> unknown 

<400> 44 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
1 5 10 15 

Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 

20 25 30 

Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro He 

35 40 45 

Val Leu Gly Val Ser Val Tyr Leu Thr He Val He Gly Gly Leu Leu 
50 55 60 

Trp He Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80 

Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 

85 90 95 



0093/0006.4 



CA 02485060 2004-10-28 



176 

Leu Tyr Met Cys Val Gly He Ala Tyr Gin Ala He Thr frp Axg Tyr 

100 105 no 

ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala He 
115 120 125 

Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 

Val He Met He Leu Lys Arg Ser Thr Arg Gin He Ser Phe Leu His 
145 ISO 155 i 6 o 

val Tyr His His Ser Ser He Ser Leu He Trp Trp Ala lie Ala His 

165 170 175 

His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 

180 185 190 

Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 

■ 

Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 

Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
225 230 235 240 

Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys He 

245 250 255 

Leu Phe Tyr Tyr Met He Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 

260 265 270 



Val Gin Lys Tyr He Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
27 5 280 285 



Thr Glu 
290 
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<210> 45 
<211> 525 
<212> PRT 
<213> Unknown 

<400> 45 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
15 10 15 

He Asp Val Glu His He Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 

20 25 30 

Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser Val His Ser He Gin 
35 40 45 

Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
50 55 60 

Val Gin Cys He Ser Ala Glu Val Gin Arg Asn Ser Ser Thr Gin Gly 
65 70 75 80 

Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 

85 90 95 

4 

Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 

100 . 105 110 

His Asn Lys Pro Ser Asp Cys Trp He Val Val Lys Asn Lys Val Tyr 
115 120 125 

Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val He Ser 
130 135 140 

Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 155 160 

Ala Ser Thr Trp Lys He Leu Gin Asp Phe Tyr He Gly Asp Val Glu 

165 170 175 



Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
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Ala Leu Phe Leu Arg 

195 

Val Met Lys Leu Leu 
210 

lie lie Cys Trp Ser 
225 

Met Met Ala Leu Cys 

245 

Leu His Asn Gin Val 

260 

Tyr Val He Gly Asn 
275 

Glu Lys His Asn Leu 
290 

Tyr Gin Pro He Asp 
305 

Ser Lys Asp He Leu 

325 

Leu Gin Tyr Gin His 

340 

Gly Ser Trp Leu Phe 
355 

Ser Pro Val Asp Arg 
370 

Phe Trp Phe Val Gly 
385 



178 

185 

Glu Gin Leu Phe Lys Ser 
200 

Thr Asn Val Ala He Phe 
215 

Lys Thr He Ser Ala Val 
230 235 

Phe Gin Gin Cys Gly Trp 

250 

Phe Glu Thr Arg Trp Leu 

265 

Ala Val Leu Gly Phe Ser 
280 

His His Ala Ala Pro Asn 
295 

Glu Asp He Asp Thr Leu 
310 315 

Ala Thr Val Glu Asn Lys 

330 

Leu Phe Phe Met Gly Leu 

345 

Trp Ser Trp Arg Tyr Thr 
360 

Leu Leu Glu Lys Gly Thr 
375 

Thr Ala Cys Tyr Leu Leu 
390 395 



190 

Ser Lys Leu Tyr Tyr 
205 

Ala Ala Ser He Ala 
220 

Leu Ala Ser Ala Cys 

240 

Leu Ser His Asp Phe 

255 

Asn Glu Val Val Gly 
270 

Thr Gly Trp Trp Lys 
285 

Glu Cys Asp Gin Thr 
300 

Pro Leu He Ala Trp 

320 

Thr Phe Leu Arg He 

335 

Leu Phe Phe Ala Arg 
350 

Ser Thr Ala Val Leu 
365 

Val Leu Phe His Tyr 
380 

Pro Gly Trp Lys Pro 

400 
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• ■ 

Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 

405 410 415 

Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 

420 425 430 

Lys Glu Phe Val Ser Ala Gin lie Val Ser Thr Arg Asp lie Lys Gly 
435 440 445 

Asn lie Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gin He Glu 
450 455 460 

His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Lys He Ala 
465 470 475 480 

Pro Arg Val Glu Val Phe Cys Lys Lys His Gly Leu Val Tyr Glu Asp 

4B5 490 495 

Val Ser He Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 

500 505 510 

Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr Thr Ser 

515 520 525 

- 



<210> 46 
<211> 17752 
<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with 3 

promoter-terminator expression cassettes, 
inserted with Physcomitrella elongase + desaturase 
+ Phaeodactylum desaturase 

<220> 
<221> CDS 

<222> (11543) (12415) 
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<220> 
<221> CDS 

<222> (13313) (14890) 

<220> 
<221> CDS 

<222> (15791) (17200) 
<400> 46 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 
tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 
ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 2 40 
atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 
ttctttatca ctgataagtt ggtggacata. ttatgtttat cagtgataaa gtgtcaagca 360 
tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 
gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 
tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 
cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 
ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 
ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 
gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 
ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 
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ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 
ggaggctcgt. tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 
ccaccgcgtc agacgcccgt agcagcccgc tacgggct/tt ttcatgccct gccctagcgt 1140 
ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
gaccctgccg cttaccggat, acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 
ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 
tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 
ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 
ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 
ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 
agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 
aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 
aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
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tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagca'g gacgagcttg 2160 
gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 
aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2 280 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2 340 
gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2 460 
cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2 640 
gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2 700 
tgacagatga ggggcgcgat ccttgacact . tgaggggcag agtgctgaca gatgaggggc 2 760 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2 820 

- 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2 880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
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tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
at'tgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

* 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3 84 0 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4 020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 408 0 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 414 0 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 432 0 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 438 0 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccaraattg 4440 ' 
tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 
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aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 
cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 
aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 
gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 
gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4 980 
aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 
ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 
gaagaagaca ctccatttaa agatccgcgc. gagctgtatg attt.tttaaa gacggaaaag 5160 
cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 522 0 

■ ■ 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 
gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 
ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 
ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 
caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 
gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 
cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 
ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
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cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtgt ggcagcaggt. gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

■ 

m 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 

cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 684 0 

tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
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agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgc'a cttgcttcgc 6960 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 
ttgacaattg tgattaaggc tcagattcga cggcrttggag cggccgacgt gcaggatttc 7080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 
cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 
ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 
aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 
ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 
cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 
tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 
acggtaggcg ctgtgcagcc gctgatggtc. gtgttcatct ctgccgctct gctaggtagc 7560 
ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7620 
gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 
gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7 800 
tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 
ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 
acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 
aggggagttg atatxgtcaa cgttcacttc taaagaaata gcgccac-tca gcttcctcag 8100 
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cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 
gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

* 

m 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 
gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 87 00 
agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 
tgaaccatca cccaaatcaa gttttt/tggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcc-tcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 
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tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 
agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 94 80 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 954 0 
ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 972 0 

* 

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 97 8 0 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 984 0 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 
tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 996 0 
agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 100 SO 
ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 
agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 
acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 
gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 
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gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 
gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 
actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 
gcgtgaagct tgcatgcctg . caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

* 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagccca ccgcggtgga aa atg gag gtc gtg gag aga ttc tac ggt gag 11572 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu 
15 10 

ttg gat ggg aag gtc teg cag ggc gtg aat gca ttg ctg ggt agt ttt 11620 
Leu Asp Gly Lys Val Ser Gin Gly Val Asn Ala Leu Leu Gly Ser Phe 
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15 20 25 

ggg gtg gag ttg acg gat acg ccc act acc aaa ggc ttg ccc etc gtt 11668 
Gly Val Glu Leu Thr Asp Thr Pro Thr Thr Lys Gly Leu Pro Leu Val 

30 35 40 



gac agt ccc aca ccc ate gtc etc ggt gtt tct gta tac ttg act att 11716 
Asp Ser Pro Thr Pro He Val Leu Gly Val Ser Val Tyr Leu Thr He 
45 50 55 

gtc att gga ggg ctt ttg tgg ata aag gec agg gat ctg aaa ccg cgc 11764 
Val He Gly Gly Leu Leu Trp He Lys Ala Arg Asp Leu Lys Pro Arg 
60 65 70 

gec teg gag cca ttt ttg etc caa get ttg gtg ctt gtg cac aac ctg 11812 
Ala Ser Glu Pro Phe Leu Leu Gin Ala Leu Val Leu Val His Asn Leu 
75 80 85 90 



ttc tgt ttt gcg etc agt ctg tat atg tgc gtg ggc ate get tat cag 11860 
Phe Cys Phe Ala Leu Ser Leu Tyr Met Cys Val Gly He Ala Tyr Gin 

95 100 105 



get att acc tgg egg tac tct etc tgg ggc aat gca tac aat cct aaa 11908 
Ala He Thr Trp Arg Tyr Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys 

U0 115 120 



cat aaa gag atg gcg att 
His Lys Glu Met Ala He 
125 

gtg gaa ttc atg gat acc 
Val Glu Phe Met Asp Thr 
140 

caa ata age ttc etc cac 
Gin He Ser Phe Leu His 
155 160 



ctg gta tac ttg ttc tac 

Leu Val Tyr Leu Phe Tyr 
130 

gtt ate atg ata ctg aag 

Val He Met He Leu Lys 

145 150 

gtt tat cat cat tct tea 
Val Tyr His His Ser Ser 

165 



atg tct aag tac 11956 

Met Ser Lys Tyr 

135 

cgc age acc agg 12004 
Arg Ser Thr Arg 

att tec etc att 12052 
He Ser Leu He 

170 



tgg tgg get att get cat cac get cct ggc ggt gaa gca tat tgg tct 
Trp Trp Ala He Ala His His Ala Pro Gly Gly Glu Ala Tyr Trp Ser 



12100 
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175 180 ' 185 

gcg get ctg aac tea gga gtg cat gtt etc atg tat gcg tat tac ttc 12148 
Ala Ala Leu Asn Ser Gly Val His Val Leu Met Tyr Ala Tyr Tyr Phe 

190 195 200 



ttg get gec tgc ctt cga agt age cca aag tta aaa aat aag tac ctt 12196 
Leu Ala Ala Cys Leu Arg Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu 
205 210 215 



ttt tgg ggc agg tac ttg aca caa ttc caa atg ttc cag ttt atg ctg 122 4 4 
Phe Trp Gly Arg Tyr Leu Thr Gin Phe Gin Met Phe Gin Phe Met Leu 
220 225 230 

• 

aac tta gtg cag get tac tac gac atg aaa acg aat gcg cca tat cca 12292 
Asn Leu Val Gin Ala Tyr Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro 
235 240 245 250 



caa tgg ctg ate aag att ttg ttc tac tac atg ate teg ttg ctg ttt 12340 

Gin Trp Leu lie Lys lie Leu Phe Tyr Tyr Met lie Ser Leu Leu Phe 

255 260 265 

ctt ttc ggc aat ttt tac gta caa aaa tac ate aaa ccc tct gac gga 12388 

Leu Phe Gly Asn Phe Tyr Val Gin Lys Tyr lie Lys Pro Ser Asp Gly 

270 275 280 



aag caa aag gga get aaa act gag tga tctagaaggc ctcctgcttt 12435 
Lys Gin Lys Gly Ala Lys Thr Glu 
285 290 

aatgagatat gcgagacgcc tatgatcgea tgatatttgc tttcaattct gttgtgcacg 1249 5 

ttgtaaaaaa cctgagcatg tgtagctcag atccttaccg ccggtttcgg ttcattctaa 12555 

tgaatatatc acccgttact ategtatttt tatgaataat attctcegtt caatttactg 12615 



attgtccgtc gagcaaattt acacattgcc actaaaegtc taaacccttg taatttgttt 12675 
ttgttttact atgtgtgtta tgtatttgat ttgcgataaa tttttatatt tggtactaaa 12735 
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tttataacac cttttatgct aacgtttgcc aacacttagc aatttgcaag ttgattaatt 12795 
gattctaaat tatttttgtc ttctaaatac atatactaat caactggaaa tgtaaatatt 12855 
tgctaatatt tctactatag gagaattaaa gtgagtgaat atggtaccac aaggtttgga 12915 

■ 

gatttaattg ttgcaatgct gcatggatgg catatacacc aaacattcaa taattcttga 12975 

ggataataat ggtaccacac aagatttgag gtgcatgaac gtcacgtgga caaaaggttt 13035 

agtaattttt caagacaaca atgttaccac acacaagttt tgaggtgcat gcatggatgc 13095 

cctgtggaaa gtttaaaaat attttggaaa tgatttgcat ggaagccatg tgtaaaacca 13155 

tgacatccac ttggaggatg caataatgaa gaaaactaca aatttacatg caactagtta 13215 

tgcatgtagt ctatataatg aggattttgc aatactttca ttcatacaca ctcactaagt 13275 

tttacacgat tataatttct tcatagccag cggatcc atg gta ttc gcg ggc ggt 1333 0 

Met Val Phe Ala Gly Gly 

295 

gga ctt cag cag ggc tct etc gaa gaa aac ate gac gtc gag cac att 13378 
Gly Leu Gin Gin Gly Ser Leu Glu qlu Asn He Asp Val Glu His He 
300 305 310 

gec agt atg tct etc ttc age gac ttc ttc agt tat gtg tct tea act 1342 6 
Ala Ser Met Ser Leu Phe Ser Asp Phe Phe Ser Tyr Val Ser Ser Thr 
315 320 325 

gtt ggt teg tgg age gta cac agt ata caa cct ttg aag cgc ctg acg 13474 
Val Gly Ser Trp Ser Val His Ser He Gin Pro Leu Lys Arg Leu Thr 
330 335 340 345 

agt aag aag cgt gtt teg gaa age get gee gtg caa tgt ata tea get 13522 
Ser Lys Lys Arg Val Ser Glu Ser Ala Ala Val Gin Cys He Ser Ala 

350 355 360 

gaa gtt cag aga aat teg agt acc cag gga act gcg gag gca etc gca 13570 
Glu Val Gin Arg Asn Ser Ser Thr Gin Gly Thr Ala Glu Ala Leu Ala 
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gaa tea gtc 
Glu Ser Val 
380 

teg aca cac 
Ser Thr His 
395 

tgc tgg att 
Cys Trp lie 
41D 

gac gag cat 
Asp Glu His 

ggc aca gat 
Gly Thr Asp 

ctt caa gac 
Leu Gin Asp 

460 

gag ctg ctg 
Glu Leu Leu 
475 

caa ctt ttc 
Gin Leu Phe 
490 

aat gtt get 
Asn Val Ala 



act att tea 
Thr He Ser 



365 

gtg aag ccc 
Val Lys Pro 

ccc eta tea 
Pro Leu Ser 

gtt gta aaa 
Val Val Lys 
415 

ccc gga gga 
Pro Gly Gly 
430 

gtt ttc tct 
val Phe Ser 
445 

ttt tac att 
Phe Tyr lie 

aaa gat ttc 
Lys Asp Phe 

aaa agt teg 
Lys Ser Ser 
495 

att ttt get 
He Phe Ala 
510 

gcg gtt ttg 
Ala Val Leu 



370 

acg aga cga 
Thr Arg Arg 
385 

gaa gta gca 
Glu Val Ala 
400 

aac aag gtg 
Asn Lys Val 

tea gtt att 
Ser Val lie 



agt ttt cat 
Ser Phe His 
450 

ggt gac gtg 
Gly Asp yal 
465 

cga gaa atg 
Arg Glu Met 
480 

aaa ttg tac 
Lys Leu Tyr 

gcg age att 
Ala Ser He 

get tea get 
Ala Ser Ala 



193 

agg tea tct 
Arg Ser Ser 

gta cac aac 
Val His Asn 
405 

tat gat gtt 
Tyr Asp Val 
420 

agt act tat 
Ser Thr Tyr 
435 

gca get tct 
Ala Ala Ser 

gag agg gtg 
Glu Arg Val 

aga get ctt 
Arg Ala Leu 
485 

tat gtt atg 
Tyr Val Met 
500 

gca ata ata 
Ala He He 
515 

tgt atg atg 
Cys Met Met 



375 

cag tgg aag 
Gin Trp Lys 
390 

aag cca age 
Lys Pro Ser 

tec aat ttt 
Ser Asn Phe 

* 

ttt gga cga 
Phe Gly Arg 
440 

aca tgg aaa 
Thr Trp Lys 
455 

gag ccg act 
Glu Pro Thr 
470 

ttc ctg agg 
Phe Leu Arg 

aag ctg etc 
Lys Leu Leu 

tgt tgg age 
Cys Trp Ser 
520 

get ctg tgt 
Ala Leu Cys 



aag 13618 
Lys 

gat 13666 
Asp 

gcg 13714 

Ala 

425 

gac 13762 
Asp 

att 13810 
He 



cca 13858 
Pro 



gag 13906 
Glu 



acg 13954 

Thr 

505 

aag 14002 
Lys 



ttc 14050 
Phe 
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525 530 • 535 

caa cag tgc gga tgg eta tec cat gat ttt etc cac aat cag gtg ttt 14098 
Gin Gin Cys Gly Trp Leu Ser His Asp Phe Leu His Asn Gin Val Phe 
540 545 550 

gag aca cgc tgg ctt aat gaa gtt gtc ggg tat gtg ate ggc aac gec 14146 
Glu Thr Arg Trp Leu Asn Glu Val Val Gly Tyr Val lie Gly Asn Ala 
555 560 565 

gtt ctg ggg ttt agt aca ggg tgg tgg aag gag aag cat aac ctt cat 14194 
Val Leu Gly Phe Ser Thr Gly Trp Trp Lys Glu Lys His Asn Leu His 
570 575 580 5 8 5 

cat get get cca aat gaa tgc gat cag act tac caa cca att gat gaa 14242 
His Ala Ala Pro Asn Glu Cys Asp Gin Thr Tyr Gin Pro He Asp Glu 

590 595 600 

gat att gat act etc ccc etc att gee tgg age aag gac ata ctg gec 14290 
Asp He Asp Thr Leu Pro Leu He Ala Trp Ser Lys Asp He Leu Ala 

605 610 615 

aca gtt gag aat aag aca ttc ttg cga ate etc caa tac cag cat ctg 14338 
Thr Val Glu Asn Lys Thr Phe Leu Arg He Leu Gin Tyr Gin His Leu 
620 625 630 

ttc ttc atg ggt ctg tta ttt ttc gee cgt ggt agt tgg etc ttt tgg 14386 
Phe Phe Met Gly Leu Leu Phe Phe Ala Arg Gly Ser Trp Leu Phe Trp 
635 640 645 

age tgg aga tat acc tct aca gca gtg etc tea cct gtc gac agg ttg 14434 
Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu Ser Pro Val Asp Arg Leu 
650 «5 660 665 

ttg gag aag gga act gtt ctg ttt cac tac ttt tgg ttc gtc ggg aca 14482 
Leu Glu Lys Gly Thr Val Leu Phe His Tyr Phe Trp Phe Val Gly Thr 

670 675 680 

gcg tgc tat ctt etc cct ggt tgg aag cca tta gta tgg atg gcg gtg 14530 
Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro Leu Val Trp Met Ala Val 
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685 690 695 

act gag etc atg tec ggc atg ctg ctg ggc ttt gta ttt gta ctt age 14578 
Thr Glu Leu Met Ser Gly Met Leu Leu Gly Phe Val Phe Val Leu Ser 
700 705 710 

cac aat ggg atg gag gtt tat aat teg tct aaa gaa ttc gtg agt gca 1462 6 
His Asn Gly Met Glu Val Tyr Asn Ser Ser Lys Glu Phe Val Ser Ala 
715 720 725 

cag ate gta tec aca egg gat ate aaa gga aac ata ttc aac gac tgg 14674 
Gin lie Val Ser Thr Arg Asp lie Lys Gly Asn lie Phe Asn Asp Trp 
730 735 740 745 

ttc act ggt ggc ctt aac agg caa ata gag cat cat ctt ttc cca aca 14722 
Phe Thr Gly Gly Leu Asn Arg Gin He Glu His His Leu Phe Pro Thr 

750 755 760 

atg ccc agg cat aat tta aac aaa ata gca cct aga gtg gag gtg ttc 147 70 
Met Pro Arg His Asn Leu Asn Lys He Ala Pro Arg Val Glu Val Phe 

765 770 775 

tgt aag aaa cac ggt ctg gtg tac gaa gac gta tct att get acc ggc 14818 
Cys Lys Lys His Gly Leu Val Tyr Glu Asp Val Ser He Ala Thr Gly 
780 785 . 790 

act tgc aag gtt ttg aaa gca ttg aag gaa gtc gcg gag get gcg gca 148 66 
Thr Cys Lys Val Leu Lys Ala Leu Lys Glu Val Ala Glu Ala Ala Ala 
795 800 805 

gag cag cat get acc acc agt taa getagegtta accctgcttt aatgagatat 14 920 
Glu Gin His Ala Thr Thr Ser 
810 815 

gcgagacgcc tatgatcgea tgatatttgc tttcaattct gttgtgcacg ttgtaaaaaa 14 980 

cctgagcatg tgtagctcag atccttaccg ccggtttcgg ttcattctaa tgaatatatc 15040 

acccgttact ategtatttt tatgaataat attctcegtt caatttactg attgtccgtc 15100 
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gagcaaattt acacattgcc actaaacgtc taaacccttg taatttgttt ttgttttact 15160 
atgtgtgtta tgtatttgat ttgcgataaa tttttatatt tggtactaaa tttataacac 15220 
cttttatgct aacgtttgcc aacacttagc aatttgcaag ttgattaatt gattctaaat 15280 
tatttttgtc ttctaaatac atatactaat caactggaaa tgtaaatatt tgctaatatt 15340 
tctactatag gagaattaaa gtgagtgaat atggtaccac aaggtttgga gatttaattg 15400 
ttgcaatgct gcatggatgg catatacacc aaacattcaa taattcttga ggataataat 15460 
ggtaccacac aagatttgag gtgcatgaac gtcacgtgga caaaaggttt agtaattttt 15520 
caagacaaca atgttaccac acacaagttt tgaggtgcat gcatggatgc cctgtggaaa 15580 

i 

gtttaaaaat attttggaaa tgatttgcat ggaagccatg tgtaaaacca tgacatccac 15640 

ttggaggatg caataatgaa gaaaactaca aatttacatg caactagtta tgcatgtagt 15700 

ctatataatg aggattttgc aatactttca ttcatacaca ctcactaagt tttacacgat 15760 

tataatttct tcatagccag cagatctaaa atg get ccg gat gcg gat aag ctt 15814 

.Met Ala Pro Asp Ala Asp Lys Leu - 
820 825 

cga caa cgc cag acg act gcg gta gcg aag cac aat get get acc ata 15862 
Arg Gin Arg Gin Thr Thr Ala Val Ala Lys His Asn Ala Ala Thr lie 

830 835 840 

teg acg cag gaa cgc ctt tgc agt ctg tct teg etc aaa ggc gaa gaa 15910 
Ser Thr Gin Glu Arg Leu Cys Ser Leu Ser Ser Leu Lys Gly Glu Glu 

845 850 855 

gtc tgc ate gac gga ate ate tat gac etc caa tea ttc gat cat ccc 15958 
Val Cys lie Asp Gly lie He Tyr Asp Leu Gin Ser Phe Asp His Pro 
860 865 870 

999 ggt gaa acg ate aaa atg ttt ggt ggc aac gat gtc act gta cag 16006 
Gly Gly Glu Thr He Lys Met Phe Gly Gly Asn Asp Val Thr Val Gin 
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875 880 885 

tac aag atg att cac ccg tac cat acc gag aag cat ttg gaa aag atg 
Tyr Lys Met lie His Pro Tyr His Thr Glu Lys His Leu Glu Lys Met 
890 895 900 905 

aag cgt gtc ggc aag gtg acg gat ttc gtc tgc gag tac aag ttc gat 
Lys Arg Val Gly Lys Val Thr Asp Phe Val Cys Glu Tyr Lys Phe Asp 

910 915 920 

acc gaa ttt gaa cgc gaa ate aaa cga gaa gtc ttc aag att gtg cga 
Thr Glu Phe Glu Arg Glu He Lys Arg Glu Val Phe Lys He Val Arg 

925 930 935 

cga ggc aag gat ttc ggt act ttg gga tgg ttc ttc cgt gcg ttt tgc 
Arg Gly Lys Asp Phe Gly Thr Leu Gly Trp Phe Phe Arg Ala Phe Cys 
940 945 950 

tac att gec att ttc ttc tac ctg cag tac cat tgg gtc acc acg gga 
Tyr He Ala He Phe Phe Tyr Leu Gin Tyr His Trp Val Thr Thr Gly 
955 960 965 

acc tct tgg ctg ctg gec gtg gec tac gga ate tec caa gcg atg att 
Thr Ser Trp Leu Leu Ala Val Ala Tyr Gly He Ser Gin Ala Met He 
970 975 980 985 

ggc atg aat gtc cag cac gat gec aac cac ggg gec acc tec aag cgt 
Gly Met Asn Val Gin His Asp Ala Asn His Gly Ala Thr Ser Lys Arg 

990 995 1000 

ccc tgg gtc aac gac atg eta ggc etc ggt gcg gat ttt att ggt ggt 
Pro Trp Val Asn Asp Met Leu Gly Leu Gly Ala Asp Phe He Gly Gly 

1005 1010 1015 

tec aag tgg etc tgg cag gaa caa cac tgg acc cac cac get tac acc 
Ser Lys Trp Leu Trp Gin Glu Gin His Trp Thr His His Ala Tyr Thr 
1020 1025 1030 



aat cac gee gag atg gat ccc gat age ttt ggt gec gaa cca atg etc 
Asn His Ala Glu Met Asp Pro Asp Ser Phe Gly Ala Glu Pro Met Leu 
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1035 1040 1045 

eta ttc aac gac tat ccc ttg gat cat ccc get cgt acc tgg eta cat 16534 
Leu Phe Asn Asp Tyr Pro Leu Asp His Pro Ala Arg Thr Trp Leu His 
1050 1055 1060 1065 

cgc ttt caa gca ttc ttt tac atg ccc gtc ttg get gga tac tgg ttg 16582 
Arg Phe Gin Ala Phe Phe Tyr Met Pro Val Leu Ala Gly Tyr Trp Leu 

1070 1075 1080 

tec get gtc ttc aat cca caa att ctt gac etc cag caa cgc ggc gca 16630 
Ser Ala Val Phe Asn Pro Gin He Leu Asp Leu Gin Gin Arg Gly Ala 

1085 1090 1095 

ctt tec gtc ggt ate cgt etc gac aac get ttc att cac teg cga cgc 16678 
Leu Ser Val Gly He Arg Leu Asp Asn Ala Phe He His Ser Arg Arg 
1100 H05 mo 

aag tat gcg gtt ttc tgg egg get gtg tac att gcg gtg aac gtg att 16726 
Lys Tyr Ala Val Phe Trp Arg Ala Val Tyr He Ala Val Asn Val He 
I 115 H20 H25 

get ccg ttt tac aca aac tec ggc etc gaa tgg tec tgg cgt gtc ttt 16774 
Ala Pro Phe Tyr Thr Asn Ser Gly Leu Glu Trp Ser Trp Arg Val Phe 
H30 1135 1140 H45 

gga aac ate atg etc atg ggt gtg gcg gaa teg etc gcg ctg gcg gtc 16822 
Gly Asn He Met Leu Met Gly Val Ala Glu Ser Leu Ala Leu Ala Val 

1150 H55 n60 

ctg ttt teg ttg teg cac aat ttc gaa tec gcg gat cgc gat ccg acc 16870 
Leu Phe Ser Leu Ser His Asn Phe Glu Ser Ala Asp Arg Asp Pro Thr 

1165 1170 H75 

gee cca ctg aaa aag acg gga gaa cca gtc gac tgg ttc aag aca cag 16918 
Ala Pro Leu Lys Lys Thr Gly Glu Pro Val Asp Trp Phe Lys Thr Gin 
1180 H85 H90 



gtc gaa act tec tgc act tac ggt gga ttc ctt tec ggt tgc ttc acg 
Val Glu Thr Ser Cys Thr Tyr Gly Gly Phe Leu Ser Gly Cys Phe Thr 



16966 
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1195 1200 1205 

gga ggt etc aac ttt cag gtt gaa cac cac ttg ttc cca cgc atg age 17014 
Gly Gly Leu Asn Phe Gin Val Glu His His Leu Phe Pro Arg Met Ser 
1210 1215 1220 1225 

age get tgg tat ccc tac att gec ccc aag gtc cgc gaa att tgc gec 17062 
Ser Ala Trp Tyr Pro Tyr lie Ala Pro Lys Val Arg Glu lie Cys Ala 

1230 1235 1240 

aaa cac ggc gtc cac tac gec tac tac ccg tgg ate cac caa aac ttt 17110 

Lys His Gly Val His Tyr Ala Tyr Tyr Pro Trp lie His Gin Asn Phe 

1245 1250 1255 

* 

etc tec acc gtc cgc tac atg cac gcg gec ggg acc ggt gec aac tgg 17158 

Leu Ser Thr Val Arg Tyr Met His Ala Ala Gly Thr Gly Ala Asn Trp 

1260 1265 1270 

cgc cag atg gec aga gaa aat ccc ttg acc gga egg gcg taa 17200 
Arg Gin Met Ala Arg Glu Asn Pro Leu Thr Gly Arg Ala 
1275 1280 1285 

agatctgecg gcatcgatcc egggecatgg ectgetttaa tgagatatgc gagacgecta 172 60 

■ 

tgatcgcatg atatttgett tcaattctgt tgtgcacgtt gtaaaaaacc tgagcatgtg 17 320 
tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 17 380 
cgtattttta tgaataatat tctccgttca atttactgat tgtcegtega cgagctcggc 17 440 
gcgcctctag aggatcgatg aattcagatc ggctgagtgg ctccttcaac gttgcggttc 17500 
tgtcagttcc aaacgraaaa cggcttgtcc cgcgtcatcg gegggggtea taacgtgact 17 560 
cccttaattc tccgctcatg atcagattgt cgtttcccgc cttcagttta aactatcagt 17 620 
gtttgacagg atatattggc gggtaaacct aagagaaaag agegtttatt agaataatcg 17 680 



gatatttaaa agggcgtgaa aaggtttatc cttcgtccat ttgtatgtgc atgccaacca 17 740 



0093/00064 



CA 02485060 2004-10-28 



200 

cagggttccc ca • 17752 



<210> 47 

<211> 290 

<212> PRT 

<213> Unknown 

<400> 47 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
1 5 10 15 

Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 

20 25 . 30 

Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro lie 
35 40 45 

Val Leu Gly Val Ser Val Tyr Leu Thr He Val He Gly Gly Leu Leu 
50 55 60 

Trp He Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80 

Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 

85 90 95 

Leu Tyr Met Cys Val Gly He Ala Tyr Gin Ala He Thr Trp Arg Tyr 

100 105 no 

Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala He 
115 120 125 

Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 

Val He Met He Leu Lys Arg Ser Thr Arg Gin He Ser Phe Leu His 
145 150 155 160 



Val Tyr His His Ser Ser He Ser Leu He Trp Trp Ala He Ala His 
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165 170 175 

His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 

180 185 190 

Val His val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 

Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 

Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
225 230 235 . 240 

Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys lie 

245 250 255 

Leu Phe Tyr Tyr Met He Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 

260 265 270 

Val Gin Lys Tyr He Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 

Thr Glu 
290 



<210> 48 
<211> 525 
<212> PRT 
<213> Unknown 

<400> 48 
Met Val Phe Ala 
1 

lie Asp Val Glu 

20 

Ser Tyr Val Ser 



Gly Gly Gly Leu 
5 

His He Ala Ser 



Ser Thr Val Gly 



Gin Gin Gly Ser 
10 

Met Ser Leu Phe 

25 

Ser Trp Ser Val 



Leu Glu Glu Asn 

15 

Ser Asp Phe Phe 
30 

His Ser He Gin 
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35 



202 

40 45 



Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
50 55 60 

Val Gin Cys lie Ser Ala Glu Val Gin Arg Asn Ser Ser Thr Gin Gly 
65 70 75 80 

Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 

85 90 95 

Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 

100 105 HO 



His Asn Lys Pro 
115 



Asp Cys Trp lie Val Val Lys Asn Lys Val Tyr 
120 125 



Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val lie Ser 
130 135 140 

Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 155 160 

Ala Ser Thr Trp Lys lie Leu Gin Asp Phe Tyr He Gly Asp Val Glu 

165 170 175 

Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 

180 185 190 

Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 200 205 

Val Met Lys Leu Leu Thr Asn Val Ala He Phe Ala Ala Ser lie Ala 
210 215 220 



He He Cys Trp Ser Lys Thr He 
225 230 



Ala Val Leu Ala Ser Ala Cys 
235 240 



Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 

245 250 255 



0093/00064 



CA 02485060 2004-10-28 



Leu His Asn Gin 

260 

Tyr Val lie Gly 
275 

Glu Lys His Asa 
290 

Tyr Gin Pro lie 

305 

» 

Ser Lys Asp lie 



Leu Gin Tyr Gin 

340 

Gly Ser Trp Leu 
355 

Ser Pro Val Asp 
370 

Phe Trp Phe Val 
385 

Leu Val Trp Met 



Phe Val Phe Val 

420 

Lys Glu Phe Val 
435 

Asn lie Phe Asn 
450 



Val Phe Glu Thr 



Asn Ala Val Leu 

280 

Leu His His Ala 
295 

Asp Glu Asp lie 
310 

Leu Ala Thr Val 
325 

His Leu Phe Phe 



Phe Trp Ser Trp 

360 

Arg Leu Leu Glu 
375 

Gly Thr Ala Cys 
390 

Ala Val Thr Glu 

405 

Leu Ser His Asn 



Ser Ala Gin lie 

440 

Asp Trp Phe Thr 
455 



203 

Arg Trp Leu Asn 
265 

Gly Phe Ser Thr 



Ala Pro Asn Glu 

300 

Asp Thr Leu Pro 
315 

Glu Asn Lys Thr 
330 

Met Gly Leu Leu 
345 

Arg Tyr Thr Ser 



Lys Gly Thr Val 

380 

Tyr Leu Leu Pro 
395 

Leu Met Ser Gly 
410 

Gly Met Glu Val 
425 

Val Ser Thr Arg 



Gly Gly Leu Asn 

460 



Glu Val Val Gly 
270 

Gly Trp Trp Lys 
285 

Cys Asp Gin Thr 



Leu lie Ala Trp 

320 

Phe Leu Arg He 
335 

Phe Phe Ala Arg 
350 

Thr Ala Val Leu 
365 

Leu Phe His Tyr 



Gly Trp Lys Pro 

400 

Met Leu Leu Gly 

415 

Tyr Asn Ser Ser 
430 

Asp He Lys Gly 
445 

Arg Gin He Glu 
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His His Leu Phe 
465 

Pro Arg Val Glu 



Val Ser He Ala 

500 

Val Ala Glu Ala 
515 



Pro Thr Met Pro 
470 

Val Phe Cys Lys 
485 

Thr Gly Thr Cys 



Ala Ala Glu Gin 

520 



204 

Arg His Asn Leu 
475 

Lys His Gly Leu 
490 

Lys Val Leu Lys 
505 

His Ala Thr Thr 



Asn Lys He Ala 

480 

Val Tyr Glu Asp 
495 

Ala Leu Lys Glu 
510 

Ser 
525 



<210> 49 
<211> 469 
<212> PRT 
<213> Unknown 

<400> 49 
Met Ala Pro Asp 
1 

Ala Lys His Asn 

20 

Leu Ser Ser Leu 
35 

Asp Leu Gin Ser 
50 

Gly Gly Asn Asp 
65 

Thr Glu Lys His 

Phe Val Cys Glu 

100 



Ala Asp Lys Leu 
5 

Ala Ala Thr He 



Lys Gly Glu Glu 

40 

Phe Asp His Pro 

55 

Val Thr Val Gin 
70 

Leu Glu Lys Met 
85 

Tyr Lys Phe Asp 



Arg Gin Arg Gin 

10 

Ser Thr Gin Glu 

. 25 

Val Cys He Asp 



Gly Gly Glu Thr 

60 

Tyr Lys Met lie 

75 

Lys Arg Val Gly 
90 

Thr Glu Phe Glu 
105 



Thr Thr Ala Val 

15 

Arg Leu Cys Ser 
30 

Gly He He Tyr 
45 

He Lys Met Phe 



His Pro Tyr His 

80 

Lys Val Thr Asp 

95 

Arg Glu He Lys 
110 
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Arg Glu Val Phe 
115 

Gly Trp Phe Phe 
130 

Gin Tyr His Trp 
145 

Tyr Gly lie Ser 



Ash His Gly Ala 

180 

Leu Gly Ala Asp 
195 

His Trp Thr His 
210 

Ser Phe Gly Ala 
225 

His Pro Ala Arg 



Pro Val Leu Ala 

260 

Leu Asp Leu Gin 
275 

Asn Ala Phe lie 
290 

Val Tyr lie Ala 
305 

Leu Glu Trp Ser 



Lys lie Val Arg 

120 

Arg Ala Phe Cys 
135 

Val Thr Thr Gly 
150 

Gin Ala Met He 
165 

Thr Ser Lys Arg 



Phe He Gly Gly 

200 

His Ala Tyr Thr 
215 

Glu Pro Met Leu 
230 

Thr Trp Leu His 
245 

Gly Tyr Trp Leu 



Gin Arg Gly Ala 

280 

His Ser Arg Arg 
295 

Val Asn Val lie 
310 

Trp Arg Val Phe 



205 

Arg Gly Lys Asp 

Tyr He Ala He 

140 

Thr Ser Trp Leu 

* 

155 

Gly Met Asn Val 
170 

Pro Trp Val Asn 

185 

Ser Lys Trp Leu 



Asn His Ala Glu 

220 

Leu Phe Asn Asp 
235 

Arg Phe Gin Ala 
250 

Ser Ala Val Phe 
265 

Leu Ser Val Gly 



Lys Tyr Ala Val 

300 

Ala Pro Phe Tyr 
315 

Gly Asn He Met 



Phe Gly Thr Leu 
125 

Phe Phe Tyr Leu 



Leu Ala Val Ala 

160 

Gin His Asp Ala 
175 

Asp Met Leu Gly 
190 

Trp Gin Glu Gin 
205 

Met Asp Pro Asp 



Tyr Pro Leu Asp 

240 

Phe Phe Tyr Met 
255 

Asn Pro Gin He 
270 

He Arg Leu Asp 
285 

Phe Trp Arg Ala 

Thr Asn Ser Gly 

320 

Leu Met Gly Val 
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Ala Glu Ser Leu Ala 

340 

Glu Ser Ala Asp Arg 
355 

Pro Val Asp Trp Phe 
370 

Gly Phe Leu Ser Gly 
385 

His His Leu Phe Pro 

405 

Pro Lys Val Arg Glu 

420 

Tyr Pro Trp lie His 
435 

Ala Ala Gly Thr Gly 
450 

Leu Thr Gly Arg Ala 
465 



206 

330 

Leu Ala Val Leu Phe Ser 

345 

Asp Pro Thr Ala Pro Leu 
360 

Lys Thr Gin Val Glu Thr 
375 

Cys Phe Thr Gly Gly Leu 
390 395 

Arg Met Ser Ser Ala Trp 

410 

lie Cys Ala Lys His Gly 

425 

Gin Asn Phe Leu Ser Thr 
440 

Ala Asn Trp Arg Gin Met 
455 



335 

Leu Ser His Asn Phe 
350 

Lys Lys Thr Gly Glu 
365 

Ser Cys Thr Tyr Gly 
380 

Asn Phe Gin Val Glu 

400 

Tyr Pro Tyr He Ala 

415 

Val His Tyr Ala Tyr 
430 

Val Arg Tyr Met His 
445 

Ala Arg Glu Asn Pro 
460 



<210> 50 
<211> 26 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polylinker 



<400> 50 

gaattcggcg cgccgagctc ctcgag 
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<210> 51 
<211> 265 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polylinker-terminator-polylinker 
<400> 51 

ccaccgcggt gggcggccgc ctgcagtcta gaaggcctcc tgctttaatg agatatgcga 60 
gacgcctatg atcgcatgat atttgctttc aattctgttg tgcacgttgt aaaaaacctg 120 
agcatgtgta gctcagatcc ttaccgccgg tttcggttca ttctaatgaa tatatcaccc 18 0 
gttactatcg tatttttatg aataatattc tccgttcaat ttactgattg tccgtcgacg 240 
aattcgagct cggcgcgcca agctt 265 

<210> 52 
<211> 257 
<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Polylinker-terminator-polylinker 
<400> 52 

ggatccgata tcgggcccgc tagcgttaac cctgctttaa tgagatatgc gagacgccta 60 
tgatcgcatg atatttgctt tcaattctgt tgtgcacgtt gtaaaaaacc tgagcatgtg 120 
tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 180 
cgtattttta tgaataatat tctccgttca atttactgat tgtccgtcga cgaattcgag 240 



ctcggcgcgc caagctt 



257 
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* 

<210> 53 
<211> 257 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polylinker-terminator-polylinker 
<400> 53 

agatctgccg gcatcgatcc cgggccatgg cctgctttaa tgagatatgc gagacgccta 60 
tgatcgcatg atatttgctt tcaattctgt tgtgcacgtt gtaaaaaacc tgagcatgtg 120 
tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 180 
cgtatbttta tgaataatat tctccgttca atttactgat tgtccgtcga cgaattcgag 240 



ctcggcgcgc caagctt 



257 
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We claim: 



1. A process for the production of compounds of the general 
5 formula I: 



10 



R 1 




CH=CH^ n " 





m - 

-CH- 


a 

^0 





(I) 



JP 



in transgenic plants with a content of at least 1% by weight 
based on the total fatty acids, which process comprises the 
15 following steps : 

a) introducing, into a plant, at least one nucleic acid 
sequence which encodes a polypeptide with an 
A6-desaturase activity; and 

20 

b) introducing at least one second nucleic acid sequence 
which encodes a polypeptide with a A6-elongase activity; 
and, 

25 c) if appropriate, introducing a third nucleic acid sequence 

which encodes a polypeptide with a A 5-desaturase 
activity; 



30 



d) followed by growing and harvesting the plants; and 

where the variables and substituents in the formula I have 
the following meanings: 

rX a -OH, coenzyme A (thioester), phosphatidylcholine, 
3 5 phosphat idy lethanolamine , phoshat idy lglycerol , 

diphosphatidy lglycerol , phosphatidylserine , 
phosphatidylinositol, sphingolipid, glycoshingolipid or a 
radical of the following general formula II 



40 



H 2 C-0-R 2 



45 
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R 2 = H, phosphatidylcholine-, phosphatidylethanolamine-, 
phosphatidylglycerol- , diphosphatidylglycerol- , 
phosphatidylserine- , phosphatidylinositol- , shingolipid- , 
glycoshingolipid-, glycoshingolipid- or saturated or 
unsaturated C2-C24—alkylcarbonyl-, 



10 



R 3 « H, saturated or unsaturated C 2 — C 2 4— alkylcarbonyl-, or 

R 2 and R 3 independently of one another represent a radical of 
the general formula la 



15 




(la), 



n = 3, 4 or 6, m a 3, 4 or 5 and p = 0 or 3. 

20 2. The process according to claim 1, wherein the substituents R 2 
and R 3 independently of one another are 
c io-C22-alkylcarbonyl- - 

3. The process according to claim 1 or 2, wherein the 

25 substituents R 2 and R 3 independently of one another are Ci 6 -, 

c i8-> C 2 o- or c 22-^lkylcarbonyl-, 

4. The method according to any of claims 1 to 3, wherein the 
substituents R 2 and R 3 independently of one another are 

30 unsaturated C 16 -, C 18 -, C 2 o- or C22-alkylcarbonyl- with one, 

two, three, four or five double bonds. 

5. The method according to any of claims 1 to 4, wherein the 
transgenic plant is an oil crop. 

35 

6. The method according to any of claims 1 to 5, wherein the 
transgenic plant is selected from the group consisting of 
soya, peanut, oilseed rape, canola, linseed, evening 
primrose, verbascum, thistle, hazelnut, almond, macadamia, 

40 avocado, bay, wild roses, pumpkin/ squash, pistachios, sesame, 

sunflower, saf flower, borage, maize, poppy, mustard, hemp, 
castor-oil plant, olive, Calendula, Punica, oil palm, walnut 
or coconut. 

45 7. The method according to any of claims 1 to 6, wherein the 

compounds of the formula I are obtained from the transgenic 
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plants in the form of their oils, fats, lipids or free fatty 
acids by pressing or extraction. 

8. The process according to any of claims 1 to 7, wherein the 

5 oils, fats, lipids or free fatty acids obtained as claimed in 

claim 7 are refined. 

9. The process according to any of claims 1 to 8, wherein the 
saturated or unsaturated fatty acids present in the compounds 

10 of the formula I are. liberated. 

10. The method according to any of claims 1 to 9, wherein the 
saturated or unsaturated fatty acids are liberated by 
alkaline hydrolysis or enzymatic cleavage. 

15 

11. The method according to any of claims 1 to 10 , wherein the 
compounds of the general formula I are present in the 
transgenic plant at a content of at least 5% by weight, based 
on the total fatty acids. 

20 * 

12. The process according to any of claims 1 to 11, wherein the 
nucleic acid sequences which encode the polypeptides with 
A6-desaturase activity, A6-elongase activity or A5-desaturase 
activity are selected from the group consisting of: 

25 

a) a nucleic acid sequence with the sequence shown in SEQ ID 

NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, 
SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID 
30 NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29 or 

SEQ ID NO: 31, 

b) nucleic acid sequences which, owing to the degeneracy of 
the genetic code, are obtained by back translation of the 

35 amino acid sequences shown in SEQ ID NO: 2, SEQ ID NO: 4, 

SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID 
NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, 
SEQ ID NO: 28, SEQ ID NO: 30 or SEQ ID NO: 32, 

40 

c) derivatives of the nucleic acid sequences shown in SEQ ID 

NO: l r SEQ ID NO: 3, SEQ ID NO: 5 f SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, 
SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID 
45 NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29 or 

SEQ ID NO: 31 which encode polypeptides with the amino 
acid sequences shown in SEQ ID NO: 2, SEQ ID NO: 4, 
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SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID 
NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, 
SEQ ID NO: 28, SEQ ID NO: 30 or SEQ ID NO: 32 and which 

5 have at least 50% homology at the amino acid level, 

without the enzymatic activity of the polypeptide being 
substantially reduced. 



13. The process according to any of claims 1 to 12, wherein the 
10 nucleic acid sequences as claimed in claim 8 are linked with 

one or more regulatory signals in a nucleic acid construct. 

14. The method according to any of claims 1 to 13, wherein the 
nucleic acid construct comprises additional biosynthetic 

15 genes of the fatty acid or lipid metabolism selected from the 

group consisting of acyl-CoA dehydrogenase ( s ) , acyl-ACP 
[= acyl carrier protein] desaturase( s) , acyl— ACP 
thioesterase(s) , fatty acid acyl transferase ( s) , fatty acid 
synthase(s), fatty acid hydroxylase ( s) , acetyl-coenzyme A 

20 carboxylase^ ) , acyl-coenzyme ~A oxidase(s), fatty acid 

desaturase(s) , fatty acid acetylenases, lipoxygenases, 
triacylglycerol lipases, allene oxide synthases, 
hydroperoxide lyases or fatty acid elongase(s). 



30 



35 



40 



45 
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